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Introductory. 

I  UNDEUSTAND  by  a  factor  of  evolution  any  source  of  progressive  change  in  the 
constants — mean  values,  variabiHties,  correlations — which  suffice  to  define  an  organ 
or  character,  or  the  interrelations  of  a  group  of  organs  or  characters,  at  any  stage  in 
any  form  of  life.  To  demonstrate  the  existence  of  such  a  factor  v^e  require  to  show 
more  than  the  plausibility  of  its  effectiveness,  we  need  that  a  numerical  measure  of 
the  changes  in  the  organic  constants  shall  be  obtained  from  actual  statistical  data. 
These  data  must  be  of  sufficient  extent  to  render  the  numerical  determinations  large 
as  compared  with  their  probable  errors. 

In  a  "  Note  on  Reproductive  Selection,"  published  in  the  '  Roy.  Soc.  Proc.,'  vol.  59, 
p.  301,  I  have  pointed  out  that  if  fertility  be  inherited  or  if  it  be  correlated  with 
any  inherited  character — those  who  are  thoroughly  conversant  with  the  theory  of 
correlation  will  recognise  that  these  two  thinjjjs  are  not  the  same — then  we  have  a 
source  of  progressive  change,  a  vera  causa  of  evolution.  I  then  termed  this  factor 
of  evolution  Repi'oductive  Selection.  As  the  term  has  been  objected  to,  I  have 
adopted  Genetic  Selection  as  an  alternative.  I  mean  by  this  term  the  influence  of 
different  grades  of  reproductivity  in  producing  change  in  the  predominant  type. 

If  there  be  two  organs  A  and  both  correlated  with  fertility,  but  not  necessarily 
correlated  with  each  other,*  then  genetic  or  reproductive  selection  may  ultimately 
cause  the  predominance  in  the  population  of  two  groups,  in  which  the  organs 
A  and  B  are  widely  different  from  their  primitive  types — '  widely  different,'  because 
reproductive  selection  is  a  source  of  progressive  change.  Thus  this  form  of  selection 
can  be  a  source,  not  only  of  change,  but  of  differential  change.  As  this  differentia- 
tion is  progressive,  it  may  amount  in  time  to  that  degree  of  divergence  at  which 
crossing  between  the  two  groups  begins  to  be  difficult  or  distasteful.  We  then 
reach  in  genetic  or  reproductive  selection  a  source  of  the  origin  of  species. 

VHien  I  assert  that  genetic  (reproductive)  selection  is  a  factor  of  evolution,  I  do 
not  intend  at  present  to  dogmatise  as  to  the  amount  it  is  playing  or  has  played  in 
evolution.  I  intend  to  isolate  it  so  far  as  possible  from  all  other  factors,  and  then 
measure  its  intensity  numerically.  If  this  be  sensible,  then  the  demonstration  that 
it  is  a  factor  is  complete.  How  far  it  may  be  held  in  check  by  other  factors— 
e.g.,  natural  or  sexual  selection — is  a  matter  for  further  inquiiy.  If  three  forces, 
Ti,  F2,  F3  hold  a  system  sensibly  in  equilibrium,  then  Fi  cannot  be  asserted  to  be 
non-effective  because  no  progressive  change  is  visible ;  its  absence  would  soon  bring 
to  light  its  effectiveness. 

The  manner  in  which  genetic  (reproductive)  selection  is  to  some  extent  held  in 
check  will  be  clearer  when  my  memoir  on  the  influence  of  directed  selection  on 

*  If  r„(,  be  the  correlation  of  two  organic  characters  A  and  B,  and  C  be  a  third  character,  there  is  a 
considerable  range  of  values  of  r„  and  r^,,  for  which  r^j  may  be  zero  (see  Yule,  '  Roy.  Soc.  Proc' 
vol.  60,  p.  486). 
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variation  and  correlation  is  published.  Meanwhile  Mr.  Filon  and  I  have  shown 
that  even  a  random  selection  of  one  organ  alters  the  whole  system  of  correlated 
organs.*  Hence  genetic  (reproductive)  selection  indirectly  modifies  not  only  organs 
A  and  B,  but  all  correlated  organs.  These  modifications  must  be  consistent  with 
the  maintenance  of  stamina,  physique  and  fitness  to  the  environment,  if  the  change 
is  not  to  be  counteracted  by  natural  selection. 

So  far  as  man  is  concerned,  I  have  shownt  that  in  the  case  of  civilised  man,  the 
selective  death-rate — i.e.,  natural  selection — does  not  appear  to  counteract  repro- 
ductive selection.  A.  small  element  of  the  population  produces  the  larger  part  of  the 
following  generation.  I  thus  concluded  that  if  fertility  were  inherited,  reproductive 
selection  was  not  only  a  factor  of  evolution,  but  in  civilised  man  a  very  sensible 
factor,  i.e.,  an  apparently  incompletely  balanced  factor. 

In  the  three  years  which  have  intervened  since  writing  the  essay  just  referred  to, 
members  of  the  Department  of  Applied  Mathematics  in  University  College,  as  well 
as  other  friends,  have  occupied  their  spare  time  in  the  collection  of  data  as  to  fertility 
and  fecundity  in  the  cases  of  man  and  of  the  thoroughbred  racehorse.  About 
16,000  extracts  were  made  in  the  case  of  man,  and  more  than  7000  in  the  case  of 
thoroughbred  racehorses.  In  the  course  of  the  work,  which  proved  far  more  laborious 
than  we  had  anticipated,  many  difficulties  and  pitfalls  appeared.  But  as  a  general 
conclusion  it  seems  certain  that :  Both  fertility  and  fecundity  are  inherited,  and 
prohahly  in  the  manner  prescribed  by  the  Law  of  Ancestral  Heredity  ^ 

The  object  of  this  memoir  is  to  set  forth  the  theory  and  data  by  aid  of  which 
this  conclusion  was  reached.  It  will  be  seen  that  it  completes  the  establishment 
of  genetic  or  reproductive  selection  as  a  factor  of  evolution  by  determining  the  much 
disputed  point  as  to  whether  fertility  is  or  is  not  inherited. 

I.  Theory  of  Genetic  or  Reproductive  Selection.    By  Karl  Pearson,  F.R.S. 

(1.)  While  the  physical  result  of  fertility  in  an  individual  is  measurable,  the 
quality  of  fertility  or  fecundity  in  an  individual  differs  from  other  physical  characters 
in  that  it  does  not  allow  of  direct  measurements  except  when  the  potentiality  is 
exerted  and  the  effects  recorded.  At  present  we  are  not  able  to  measure  any  series 
of  organs  or  characters  in  individuals  and  so  ascertain  their  fertility  or  fecundity. 
At  the  same  time  there  is  little  doubt  that  these  characters  are  functions  of  the 
physical  and  measurable  organs  and  characters  of  the  body.  Such  organs  and 
characters  we  have  good  ground  for  supposing  to  be  inherited  according  to  the  Law 

*  "  Contributions  to  the  Theory  of  Evolution.— IV.  On  the  Influence  of  Random  Selection  on  Variation 
and  Correlation,"  'Phil.  Trans.,'  A,  vol.  191,  p.  234  et  seq. 

t  "  The  Chances  of  Death  and  other  Studies  in  Evolution.    Reproductive  Selection,"  vol.  1,  p.  63. 
X  See  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  386. 
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of  Ancestral  Heredity.  It  seems  therefore  worth  while  to  prove  the  following 
proposition  : 

F^'oposition  I. — Antj  character  not  itself  directly  measwxible,  hut  a  function  of 
phydcally  measurahle  characters  and  organs  inhei'ited  according  to  the  Laiv  oj 
Ancestral  Heredity,  ivill  itself  he  inherited  according  to  that  laiv. 

Thus  if  we  assume  intellectual  and  emotional  characters  to  be  ultimately  a  result 
of  physical  conformation,  we  may  be  fairly  certain  that  although  we  know  neither 
the  organs  of  which  they  are  a  function,  nor  the  nature  of  that  function,  still  they 
v,'ill  be  inherited  according  to  the  same  law  as  that  which  holds  for  physically 
measurable  oro-ans. 

Let  y  be  the  character  in  a  parent,  and  let  it  be  an  unknown  function  f  of  the 
unknown  physical  organs  x^,  Xo,  x^,  .  .  .       or  let  : 

y=f{^i,  ^2,  ^3:  '  ■  •  a?™)  (i.). 

Let  A?/ denote  the  deviation  from  the  mean  value  of  the  character  y  in  some  special 
individaal,  and  Ax  the  deviation  from  the  mean  of  any  x  organ  in  the  same  individual. 
Then  if  these  deviations  be  small  compared  with  the  mean  values  of  the  organs 
considered,  we  have  from  (i.)  above  : 

Ay  =  «!  AXi  +  ao  AXo  +  a^  Ax^  +  .  .  .  (ii.), 

where  a^,  a,  .  .  .  are  constants  independent  of  the  individual  variations. 

Let  cr  denote  a  standard  deviation,  p  a  coefficient  of  interorganic  correlation,  S  a 
summation  with  regard  to  all  individuals  with  character  y  dealt  with,  and  let  them 
be  n  in  number.    Then  : 

na-l  =  S  {Ayf  =  S  {a^  AXi  +  «2  AXa  +  0^3  Ax^  -\-  .  .  .f 

=  n  {a\cr\  +  ala-X  +  a^a-X  +  .  .  .  +  •2c(iC»2  0-.,^o-^^p^^,,.^  +  Mia^a^^a-.^p,.,^,^  +  ...); 

or 

o-^  =  S(«!a-?,)  +  2S(«i«2  0-,,,o-^^/D,.,J  (iii.), 

where  S  denotes  a  summation  ihrough  the  group  of  m  organs. 

Let  y'  denote  the  character  in  an  individual  who  is  the  offspring  of  the  individual 
of  character  y,  and  x\,  x'o,  x\  .  .  .  the  corresponding  organs.  Then,  if  we  do  not 
suppose  the  nature  of  the  function  /  to  have  changed  in  a  single  generation,  we 
have  : 

y    —  f        1}  ^  2)       3  •  •  •  ^  m)  > 

and 

Ay'  =  «!  Ax  I  +  tto  Ax'2  +     ^^'3  +  (i^-)' 

o-;,  =  t{ala-l^)  +  2%{a,a2(T,.^a;.^p,,^,,,)   ......  (v.). 

Let  r  be  a  coefficient  of  direct  heredity  expressing  the  correlation  between  parent 
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and  offspring,  and  according  to  the  Law  of  Ancestral  Heredity  the  same  for  all 
organs.    Then  multiplying  (ii.)  and  (iv.)  together  and  summing  we  have  : 

?^c^yC^yR  =  S  {Ay Ay')  =  ^(ctiS  {ax^Ax^))  -}-  X  (ciiCuS  {Ax^Ax'o  +  A.TaAx'i)), 

where  R  is  the  coefficient  of  correlation  between  the  characters  y  and  y'  in  parent  and 
offspring.    Now  : 

&  (AXiAx'i)  —  7ia;.^(r^,^r 

S{AXiAx'2  +  Ax^Ax'i)  =  no-^.^o-.y^r^^^,^  +  no-y^cr^r^,^^^, 

where  and  r^,^^^  are  what  I  have  elsewhere  termed  coefficients  of  cross- heredity. 
Now  if  the  race  be  stable  or  sensibly  stable  for  two  generations  we  shall  have  for  all 
organs  0-5/  =  cr^,.    Hence  : 

S  {AXiAx'i)  =  naj.^  X  r 

S  {Ax.Ax^  +  Ax.,Ax\)  =  no-^^o-.,^(r^^,.^  +  r,,^^^)  =  ncr^^a-,^  X  2rp,^,^, 

for  it  is  shown  in  my  memoir  on  the  Law  of  Ancestral  Heredity  "  that  on  a  probable 
hypothesis  : 

Thus  we  find  on  substitution  : 

o-^o-2,E  =  r{t{a:i(T%)  +  2S(ai«2cr,,.o-^.^/),,^,,)). 

But  (iii.)  and  (iv.)  show  us  that  cr^  =  o-y,  if  there  be  no  sensible  changes  in  a 
generation.    Hence : 

and 

R  =  r. 

Thus  the  character  which  is  a  function  of  physical  organs  is  inherited  at  the  same 
rate  as  those  organs  themselves. 

As  we  may  not  unreasonably  consider  fertility  and  fecundity  to  be  functions  of 
physically  measurable  organs,  even  if  we  cannot  specify  which  organs,  we  may, 
d  priori,  expect  fertility  and  fecundity  to  be  inherited  characters. 

(2.)  Proposition  II. — To  determine  the  numerical  values  of  the  changes  in  mean 
variation  and  correlation  if  fertility  he  inherited. 

Let  us  first  define  two  terms  which  will  be  frequently  used  in  the  sequel. 

(a.)  The  fertility  of  an  individual  shall  be  defined  as  the  total  number  of  actual 
offspring. 

*  'Roy.  Soc.  Proc.,'  vol.  62,  p.  411.  The  hypothesis  yet  awaits  an  experimenfal  verification.  The 
need  to  use  it  prevents  Proposition  1.  being  self-evident. 
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(h.)  The  fecundity  of  an  individual  shall  be  defined  as  the  ratio  of  the  total  nnmber 
of  actual  offspring  to  the  total  number  of  offspring  which  might  have  come  into 
existence  under  the  circumstances. 

These  definitions  ai^e  not  intended  to  give  precise  statistical  measures  at  this  stage 
of  our  investigations.  They  are  merely  meant  to  convey  a  general  sense  of  the 
words,  which  will  be  more  precisely  limited  when  they  are  applied  to  any  given  species. 
Fertility  and  fecundity,  as  we  have  thus  defined  them,  leave  out  of  account  individual 
conditions  and  definite  conditions  of  period,  age  and  environment,  which  must  be 
fully  stated  before  numerical  measures  can  be  made  in  any  special  case.  When  the 
words  are  used  in  this  theoretical  section  the  reader  must  suppose  the  phrase,  "under 
definite  individual  and  environmental  conditions,"  to  be  always  inserted. 

Let  Ml  =  the  mean  fertility  of  parents  of  one  sex  ;  M'l  =  the  mean  fertility  of 
parents  of  one  sex- weighted  with  their  fertility  N,  the  number  of  parents  con- 
sidered in  the  first  case,  N',  the  apparent  number  dealt  with  in  the  second  case ; 
let  (Ti  and  rr'i  be  the  standard  deviations  in  the  two  cases,  and  let  x  represent  the 
fertility  of  an  individual  parent  and  z  its  frequency  among  Ni  parents.  Let  S 
denote  summation  for  Nj  parents.  Then,  without  any  assumption  as  to  the  type  of 
frequency,  Ni  =  S  (kxz)  =  XMjNi,  where  X  is  a  constant  such  that  \x  is  the  weight 
of  a  parent  of  fertility  x.    This  follows  at  once,  since  : 

=  S  {z),       M,  =  S  (xz)/S  (z). 

Further, 

M\  =  S{XxXxz)/W,  =  ^^^ 

_  8{(x-  M,fz  +  2Mi  (xz)  -  Mfg} 

by  the  definition  of  standard-deviation.     Hence,  finally  : 


Further 


„  _  S  {Xx(x  -  M.\)-z}  _  S{(x-  M,  +  Ml)  (a;  -  Ml  +  M,  -  W.fz} 
Hence,  multiplying  out,  we  find  after  some  reductions  : 

^^  =  ^^(^-M^;+ — m;n; —  

At  first  sight  it  might  seem  a  comparatively  easy  matter  to  avoid  weighting  parents 
with  their  fertility,  but  practically  it  is  almost  impossible.    For  example,  if  records 

*  i.e.,  if  /  be  the  fertility  of  a  parent,  each  parent  is  repeated  Xf  times,  where  A  is  a  constant. 
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are  sought  of  the  fertiHty  of  mothers  in  mankind,  the  women  will  appear  under  their 
husbands'  names,  and  the  labour  of  ascertaining  whether  two  sisters  have  been 
included  is  enormous,  when  large  numbers  are  dealt  with.  But  if  two  or  more  sisters 
have  been  included,  their  mother  has  been  weighted  with  her  fertility,  and  when  we 
seek  the  correlation  between  mother  and  daughter,  it  will  be  between  mothers  and 
daughters  when  weigiited  with  fertility.  But  a  still  more  serious  difficulty  arises 
from  the  fact  that  all  records  are  themselves  weighted  records  ;  the  same  number  are 
not  married  from  each  family,  hence  we  are  more  likely  to  find  a  member  of  a  large 
family  included  than  a  member  of  a  small.  The  large  families,  when  we  seek  a  record  of 
two  generations,  are  more  likely  to  appear  than  small  families.  Precisely  the  same  diffi- 
culty occurs  when  we  are  dealing  with  thoroughbred  horses  ;  a  mare  with  large 
fertilit}'  is  less  likely  to  have  all  her  offspring  colts,  or  all  her  progeny  sold  abroad, 
some  one  or  more  will  probably  ultimately  come  to  the  stud,  and  thus  mares  of  large 
fertility  are,  d  priori,  more  likely  to  contribute  to  our  fecundity  correlation  cards. 
We  do  not  get  over  this  difficulty  by  taking  the  mother  and  only  one  of  her  offspring. 
The  record  is  still  weighted  with  fertility.  The  practical  verification  of  this  lies 
in  the  experience  that  the  fertility  of  mothers  will  always  be  found  to  be  greater 
than  that  of  daughters,  although  the  fertility  of  the  community  may  really  be 
increasing  ;  the  weighting,  of  course,  excludes  sterility  in  the  generation  of  mothers, 
but  the  mere  exclusion  of  the  sterile  is  far  from  accounting  for  the  whole  difference. 

What  we  actually  find  from  our  records  are  M'l  and  cr'i,  but  what  we  want  for  the 
problem  of  heredity  are  Mj  and  cti.  Equations  (i.)  and  (ii.)  do  not  suffice  to  determine 
these,  because  we  cannot  evaluate  the  third  moment  S  {(x  —  MiVz].  We  can  hardly, 
even  for  a  first  approximation,  assume  it  zero,  for  the  standard-deviation,  and  there- 
fore the  individual  variation  is  large  as  compared  with  the  mean  in  the  case  of 
fertility,  i.e.,  the  distribution  is  markedly  skew. 

Turning  to  offspring  of  the  same  sex  as  the  parents,  say  :  let  be  the  mean 
fertility  of  offspring  taking  one  only  to  one  parent  for  the  number  Ni  of  parents, 
supposing  the  parents  not  weighted  with  their  fertility  ;  let  M'o  be  the  mean  in  the 
same  case  when  the  parents  are  weighted  with  their  fertility ;  and  let  W2  be  the 
mean  of  cdl  recorded  offspring  of  the  second  generation.  Let  a,,  cr'2,  a-" 2  be  the 
standard  deviations  in  the  fertility  of  the  offspring  for  the  same  three  cases,  and 
r,  r,  r"  be  the  corresponding  coefficients  of  correlation  between  fertility  in  parent 
and.  in  offspring.  It  seems  to  me  that  r  is  the  coefficient  which  actually  measures 
the  real  inheritance  of  fertility,  but  that  in  any  correlation  table  that  we  can  form 
we  shall  get  r  or  r". 

Let  y  be  the  fertility  of  any  individual  among  the  offspring,  and  x  the  fertility  of 
the  corresponding  parent ;  let  \x  as  before  be  the  weighting  of  the  parent,  and  \'x 
the  number  of  offspring  included  in  the  record,  X'  being  supposed  a  constant.* 

*  I  have  been  unable  so  far  to  find  any  Kf  nsible  coi-rcl-ition  between  size  of  family  and  number 
married  in  man,  but  the  point  is  worth  a  more  elaborate  investigation. 
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We  have  at  once  the  following  results  for  the  total  numbers  dealt  with  in  each 

CclSS  ' 

Ni  =  S  {z),  N'l  =  S  {\xz)  =  XMiNi, 

W\  =  S  (XxX'xz)  =  XX'S  (xh)  =  XX'  (crl  +  .    .    .    .  (iii.). 


Turning:  to  the  means 


'is 


M,  =  S  (2/2)/N,  (iv.). 


M'„  =  S  {\xyz)/K,  =  [S  { (x  -  Ml)  (y  -  M,)  z]  +  M^MoS  (2)]/M,Ni  =  +  r  (v.). 
M"  -  S  (KxX'xvz)/!^"  -  M  +  '^^^^  +  ^{(x-M,f(y-M.^z} 

^  "T"  ^  Ml        '  Ml  (7^  +  Ni  (erf  +  MO 

after  some  reductions.    Now  make  use  of  (ii.)  and  we  have  : 

,         S  {(^  -  Mi)-^  (O/  -  M,)  -       {X  -  Ml))  z] 
M''  =  M.,  H-  r  ^^^^  +  r  ,  V  ^  /^j  ^ 

But  for  normal  correlation  the  equation  to  the  straight  line  of  regression  is  : 

2/ -      =  -  Ml). 

Hence  for  such  correlation  the  mean  value  of  y  —  M2  for  parents  a:;  —  Mi  is  equal 

to  r  —  {x  —  Ml)  and  the  summation  term  would  vanish.     For  skew  correlation, 

Mr.  Yule  has  shown  that  the  line  just  given  is  the  line  of  closest  fit  to  the  curve  oi 
regression.  Hence  even  in  the  case  of  fertihty,  where  the  correlation  is  certainly 
skew,  the  summation  term  must  be  extremely  small,  or  even  zero.  It  follows,  there- 
fore, that  we  may  write  : 

M".=  M.  +  .-^(l  +  ^-f:^.)  (vU.), 

There  is  still  another  mean  which  ought  to  be  found,  namely,  that  of  parents,  M"i, 
when  all  their  recorded  offspring  have  been  entered  on  the  correlation  table.  We 
have  : 

M"i  =  S  {XxUxxz)!^",  =  S  [x'z)/{^,  (a-l  +  Ml}}, 
or,  after  some  reductions  : 


M,  V     '  1  +  »«/Mf 
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I  now  proceed  to  the  standard  deviations  for  the  three  cases,  and  the  additional 
case  (o-"i)  for  parents. 

al  =  S{{y-M,yz]/K,  (ix.). 

Whence,  after  some  reductions,  we  find  : 


'2    2 

tr  2  —  (Tz 


(x.). 


Now  for  a  nearly  straight  line  of  regression  : 

2/  —  M2  =  r     (a;  —  Ml)  +  77 

where  rj  is  uncorrelated  with  x  —  Mi.  It  follows  accordingly  that  S{{x  —  Mi)^7;z]  and 
S{(x  —  Mi)r7"2;}  will  both  vanish,  since  S{r})  for  an  array  and  S  (.t  —  Mj)  for  (;he 
whole  correlation  surface  will  be  zero.  Hence  the  summation  term  in  {x.)  is  either 
absolutely  zero  or  extremely  small.    We  have  accordingly  : 

'2    2    I  1      L  ^.2  /  1 


cr1  =  o-2-    l  +  r^r4-  1   (xi.). 


Before  we  proceed  to  determine  <t"2  and  (t'\  it  seems  simplest  to  find  the  coefficients 
of  correlation     r  and  r" .    We  have  : 

''=S((x-  Mi)(2/-M2)2}/(Ni(ri(r2)  (xii.). 

To  find  r  we  have  : 

r  =  ^{\xz{x  -  M'l)  [y  -  W,)]/ (N\o-\a-',). 

Now 

2/-M2  =  ^(x-M,)  +  r7, 
where  yj  is  sensibly  un-correlated  with  x  —  Mi.    Hence  : 

NVio-V  =  Sjx^rz  {x  -  M'l)  (r^'  {x  -  M^  +  M2  -  M',,  +  7;)  j- 
Expanding,  the  summations  with  t]  vanjsh,  and 
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M,-M'.-r^M,=  -^(^  +  M,)by(v.) 

=  -  r^  M',  by  (i.) 

N'iO-',cr',r'  =  SfXxz  {x  -  M',)      {x  -  M'^)} 
=  — '  S{Xx^(x~M'i)2} 

=  JN  lO-  1. 


But 
thus  : 


Thus  we  deduce 


_   


or 


ra'^lcy'x  ~  rar.Ja-i   (xiii.). 


This  result  has  the  simple  interpretation  that  while  the  coefficient  of  correlation  is 
changed,  the  coefficient  of  regression  is  unchanged  hy  iveighting  fertility,  or  by 
reproductive  selection. 

This  important  conclusion  is  only  an  illustration  of  a  very  interesting  theorem, 
which  has  been  referred  to  in  another  memoir*  and  will  be  proved  generally  in  a 
memoir  on  directed  selection,  written  but  not  yet  published,  ^.e.,  that  in  a  wide  range 
of  cases  selection,  whether  random  or  directed  (natural  and  artificial)  changes 
correlation  but  not  regTession. 

Before  proceeding  further  a  general  remark  will  enable  us  to  considerably  simplify 
the  otherwise  lengthy  algebra.  Namely,  the  relation  of  M"i,  M'^,  o-",,  cr"2,  r"  to  M',, 
M'._,,  cr'i,  (x'o,  r'  is  precisely  the  same  as  that  of  M'l,  M'2,  (t\,  r  themselves  to  Mi, 
Mo,  cTj,  cr.j,  r.  Consequently  an  interchange  of  symbols  in  results  already  found  will 
lead  us  to  the  remaining  formulae  needful. 

As  an  illustration  of  this,  let  us  verify  the  result  we  have  found  for  W^.  By  an 
interchange  in  (v.)  : 


W\  =  W,  -h  r' 


hence  using  (v.),  (i.)  and  (xiii.),  we  find  : 


-  M,  +    [^  +  1  +  ^/M?; ' 

*  "  Contributions  to  the  Theory  of  Evolution.— V.  On  the  Reconstruction  of  Stature,"  '  Phil.  Trans.,' 
A,  vol.  192,  p.  177. 
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exactly  the  resiilt  reached  by  a  longer  process  in  (vii.).  Similarly  (viii.)  may  be 
deduced  from  (i.).     Applying  this  to  find  r"  we  have  from  (xiii.)  : 

r" a-" 2/0-" I  —  rVVo-'u  and  therefore  =  ra-^/a-i    ....  (xiv.), 

a  result  which  again  extends  the  constancy  of  the  regression  coefficient  under  the 
action  of  reproductive  selection. 
Next  from  (xi.)  : 


1 

0 

\  or 


0-0=0-2  i  1  +  ^ 


or  using  (xiii.)  and  rearranging  : 

o-l  =  a-:-{l  +r^(-^-  1)}  (XV.). 

Again  by  interchanges  in  (ii.) : 

"2      ■     /,       -1  \  ,  S{(^-M-,)V}  ,     .  . 

Here  z  stands  for  \xz,  and  we  should  obtain  a  fourtli  moment  of  the  original 
system  of  unweighted  parents  by  substitution.  But  it  is  practically  impossible  to 
obtain  a  correlation  table  for  such  a  system.  Thus  it  is  better  to  allow  the  sum- 
mation term  to  stand  as  it  is,  where  it  represents  the  third  moment  of  a  system 
of  parents,  weighted  for  fertility  owing  to  the  nature  of  the  record,  but  not  weighted 
with  all  their  recorded  offspring,  (xvi.)  is  then  a  relation  between  the  standard- 
deviations  of  parents  weighted  solely  by  forming  a  record  and  weighted  both  by  this 
and  by  their  offspring. 

Equations  (i.)  to  (xvi.)  contain  the  chief  theoretical  relations  of  our  subject,*  and 
I  shall  consider  some  points  with  regard  to  them  in  the  following  section. 


(3.)  (a.)  If  we  wish  to  ascertain  whether  fertility  is  inherited,  we  have  to  discover 
whether  r  is  or  is  not  zero.  Now  by  (xiv.)  r  vanishes  with  both  r  and  and  accord- 
ingly either  of  these  will  suffice  to  answer  the  problem.  Still  better,  we  may  ascertain 
the  coefficient  of  regression,  and  then  whether  our  statistics  weight  for  progeny  or 
not  we  shall  obtain  the  same  value.  If  there  be  no  secular  change  taking  place  in  the 
population,  due  to  something  else  than  reproductive  selection,  we  should  expect, 
provided  the  Law  of  Ancestral  Heredity  holds  for  fertility,  that  the  regression  will 
be  near  "3  for  parent  and  offspring.! 

*  Two  of  these  formulae,  (v.)  and  (xi.),  were  given,  but  in  a  less  precisely  defined  manner,  in  my 
"  Note  on  Reproductive  Selection  "  of  1890,  '  Roy.  Soc.  Proc.,'  vol.  59,  p.  .303. 
t  See  "Law  of  Ancestral  Heredity,"  '  Roy.  Soc.  Proc.,'  vol.  02,  p.  397. 
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(6.)  If  no  reproductive  selection  exists,  i.e.,  if  fertility  be  not  inherited,  then  r  =  0, 
and 

a",  =  a',  =  o-„         =  M'j  =  M"2, 

or,  however  we  form  a  record  of  offspring-,  the  mean  value  and  variability  of  their 
fertility  ought  not  to  be  changed.  We  shall  see  later  that  this  is  very  far  from  the 
truth,  and  that  these  values  are  in  whole  or  part  sensibly  affected  by  the  manner  in 
which  the  record  is  formed. 

(c.)  Although  there  be  no  reproductive  selection,  Mj,  M'l,  and  M"i  will  not  all  be 
equal,  it  is  impossible  that  they  should  be.  Further.,  cr,,  (r\  and  cr"i  need  not  be 
equal ;  their  degree  of  sensible  divergence  will  depend  on  the  nature  of  the  primitive 
frequency  distribution  for  parents. 

(d.)  Tf  fertility  be  inherited,  or  reproductive  selection  be  an  actual  factor  of 
evolution,  then  we  see,  by  comparing  (v.)  with  (i.)  and  (vii.)  with  (viii.),  that  the 
mean  fertility  of  mothers  will  always  be  apparently  greater  than  the  mean  fertility 
of  daughters.  This  follows,  since  r  is  always  less  than  unity,  and  if  the  race  be  not 
subjected  to  secular  evolution,  other  than  that  due  to  reproductive  selection,  o-g 
cannot  differ  very  widely  from  o-i."'' 

(e.)  An  argument  from  means,  as  to  whether  fertility  is  inherited  or  not,  is  very 
likely  to  be  misleading.  We  may  choose  two  groups  from  the  record  for  comparison, 
neglecting  the  fact  that  their  frequency  in  the  record  is  not  necessarily  that  of  their 
frequency  in  the  general  population.  Thus,  if  one  person,  say,  in  four  were  married, 
a  marriage  record  of  the  community  might  exhibit  the  proper  frequency  of  families 
of  four,  but  it  would  not  do  so  of  families  of  one.  The  sort  of  fallacious  arguments 
we  have  to  be  prepared  for  are,  for  example  : 

(i.)  That  the  fertility  of  the  community  is  diminishing,  because  M'j  is  less 
than  M'l. 

(ii.)  That  the  fertility  of  the  community  is  increasing,  because  M'2  might  be  >  M, 
or  M"o  be  >  MV 

(iii.)  That  fertility  is  not  inherited,  because,  owing  to  natural  selection,  or  other 
factor  of  evolution,  one  or  other  of  these  means  for  offspring  is  sensibly  equal 
to  one  or  other  of  these  means  for  parents. 

Owing  to  the  extreme  difficulty  of  insuring  that  the  method  of  extracting  the 
record  really  gives  us  definitely  M'2,  say,  and  not  M"2  (or  W2  in  part),  I  have 
discarded  all  use  of  the  mean  values  in  attempting  to  ascertain  whether  fertility  is 
inherited.    The  following  result,  however,  is  tempting,  and  might  possibly  be  made 

*  A  difference  between  and  ac,  would  mark  natural  selection,  sexaal  selection,  or  some  other  factor 
of  secular  evolution  at  work ;  of  secular,  not  periodic,  evolution,  as  parents  and  offspring  must  have 
reached  the  same  adnlt  stnge  to  have  had  their  fertility  measured; 


CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION". 


269 


use  of  in  direct  experiments  on  breeding  insects,  where  a  record  could  be  kept  ad 
hoc.    It  follows  at  once  from  (i.),  (v.),  (vii.)  and  (viii.) : 

M'2  -  Ma       W\  -  M',  (T2  re  •     X    r  •  I     -  \ 

—  — r  =  7777  -yf  =  r  —  =  coemcient  01  resrression     .    .  (xvn.). 

M'l  -  Ml      M"i  -  M'l        0-1  ^  ^  ' 

It  is  the  second  ratio  which,  T  think,  might  with  profit  be  experimentally 
evaluated. 

(yi)  Since  the  mean  fertility  of  daughters  loaded  with  the  fertility  of  their 
mothers  is  the  fertility  of  the  next  generation,  and  we  see  that  this  is  always  greater 
than  M2,  if  r  be  not  zero,  it  follows  that  the  inheritance  of  fertility  marks  a  progres- 
sive change.  The  only  means  of  counteracting  its  influence  would  be  the  reduction 
of  M2  to  or  below  Mj  by  the  action  of  other  equally  potent  factors  of  evolution.  For 
the  existence  of  such  factors  in  man  I  shall  later  give  evidence. 

(4.)  Proposition  III. — To  extend  the  results  obtained  for  fertility  to  the  problem  of 
fecundity. 

While  the  fecundity  of  an  individual  can  often,  at  any  rate  approximately,  be 
measured,  the  fertility  is  not  ascertainable.  Thus  we  can  ascertain  the  number  of 
occasions  on  which  a  brood  mare  has  gone  to  the  stallion  and  the  number  of  foals  she 
has  produced,  but  her  fertility,  the  produce  she  might  have  had,  if  she  had 
throughout  her  whole  career  had  every  facility  for  breeding,  is  unknown  to  us.  But 
if  we  proceed  to  form  tables  for  the  inheritance  of  fecundity,  we  are  met  by  precisely 
the  same  difficulties  as  in  the  case  of  fertility.  The  more  fertile  individuals  are 
priori  more  likely  to  appear  in  the  record,  and  will  be  likely  to  be  weighted  again 
with  their  fertility  when  we  come  to  deal  with  their  offspring.'"' 

Now  it  is  certain  that  fertility  must  be  correlated  with  fecundity  ;  or,  if  x  now 
represents  the  fecundity  and  f  the  fertility,  we  shall  have  for  the  mean  fertility  for  a 
given  fecundity  x  an  expression  of  the  form  Xo  +  ^i*',  always  supposing  the  regression 
to  be  sensibly  linear.  But  the  fertility  must  vanish  with  the  fecundity,  hence  Xq  =  0, 
and  Xi  is  really  the  ratio  of  mean  fertility  to  mean  fecundity.  Thus  we  may  write  for 
the  fertility  f 

f=Kx  +  C, 

where  ^  may  vary  widely,  but  it  is  not  correlated  with  x. 

If  now  all  the  symbols  we  have  used  with  regard  to  fertility  in  Section  (2)  be  inter- 
preted as  referring  to  fecundity,  we  must  weight  with  a  factor  X f  instead  of  a  factor 
Xx,  or  with  a  factor  W^x  +  X^.  So  long  as  this  factor  is  linear,  absolutely  no  change 
can  be  made  in  the  results,  for,  C  being  uncorrelated  with  x,  all  summations  including 

*  In  the  case  of  sires  especially,  if  we  are  dealing  with  thoroughbred  horses,  their  comparative 
fewness  at  each  period  renders  it  quite  impossible  to  deal  with  one  offspring  of  each  parent  only, 
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S  (C)  vanish.  Thus  all  the  values  given  for  M.\,  M'2,  o-\,  a.nd  cr'2  remain  the  same,  if 
their  results  be  interpreted  in  the  sense  of  fecundity  and  not  fertility.  If  p  be  the 
correlation  between  fecundity  and  fertility,  and  cti,  0-3  the  standard  deviations  of 
these  quantities,  then  Xi  =  joo-g/cri ;  but  we  have  seen  that  it  is  also  the  ratio  of  mean 
fertility  to  mean  fecundity.  It  follows  accordingly  that  p  is  the  ratio  of  the  coefficient 
of  variation  in  fecundity  to  the  coefficient  of  variation  in  fertility.  If  we  may  judge 
by  the  cases  of  man  and  horse,  so  far  as  I  know  the  only  cases  in  which  fertility  and 
fecundity  have  yet  been  examined,  a  coefficient  of  variation  in  fecundity  amounts  to 
about  30  per  cent.,  while  one  in  fertility  is  something  like  50  per  cent.  Thus  the 
correlation  of  fertility  with  fecundity  would  be  about  '6.  We  should  expect  it  to 
have  a  high  value,  perhaps  even  a  higher  value  than  this.  In  the  case  of  thorough- 
bred horses,  p  will  be  the  correlation  between  fecundity  and  apjjarent  fertility.  By 
direct  investigation  in  the  case  of  1000  brood  mares  I  find  its  value  to  be  '5152. 

Passing  now  to  the  correlations  r,  r',  r",  I  observe  that  the  proof  given  for  fertility 
is  valid  with  but  few  modifications,  if  these  be  fecundity  correlations  (see  p.  266),  for 
the  proof  involves  no  expansion  of  the  factor  [X^x  +  Hence  we  conclude  that  the 
regression  coefficient  for  the  inheritance  of  fecundity  will  not  be  modified  by  the 
nature  of  the  record  or  the  weighting  of  individuals  with  their  fertility. 

When  we  come  to  the  last  series  of  constants,  M"i,  M"2,  cr",,  a"2,  we  find  that  these 
will  be  modified,  owing  to  the  presence  of  the  square  factor  {\iX  +  Cf,  although 
C  is  not  correlated  with  x.  The  term  ^  i>ow  comes  in,  and  S  (^^)  will  give  the 
standard-deviation  of  an  array  of  fertilities  corresponding  to  a  given  fecundity,  i.e., 
S  (^^)  =  0-3  (1  —  p")  X  number  in  the  array.* 

I  find  after  some  reductions  that  M"2  and  M"i  are  given  by 

M'=  =  M»  +  '-f  (^  +  rT^)  •  •  •  • 

M".  =  M.  +  I  (1  + 

the  correlation  of  fertility  and  fecundity  being  now  introduced  into  the  results. 
Clearly  the  result  (xvii.) 

M",  —  M'o 

 —  =  coefficient  of  regression  (xx.) 

still  remains  true. 

For  the  remaining  two  constants  cr''^  and  cr"i,  I  find,  after  some  rather  long 
analysis  in  the  second  case,  which  it  seems  unnecessary  to  reproduce,t 


*  Shoxild  the  regression  not  be  linear,  0-3  ■/(!  —  />-)  is  the  mean  of  the  standard-deviations  of  the  arrays, 
t  In  the  course  of  the  work  the  squared  standard-deviation  of  a  fertility  array  is  assumed  to  be  the 
s^me  for  all  arrays  =  <7^1  —  p"),  and      is  given  its  value  jxr-ila^.    See,  however,  the  previous  footnote. 
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^"l  =  ai\l+7^('^-l)\  (xxL), 


.       7^T\   ,  <^t    ,   S  {(X  -  M\fZ'}  .. 


."2 


and  y  is  the  factor  ,„  or  as  we  can  write  it 


If  p  be  unity  or  near  unity,  i.e.,  fecundity  very  closely  correlated  with  fertility, 
y  =  ],  the  second  term  vanishes  and  (xxii.)  becomes  identical  with  the  corresponding 
fertility  formula  (xvi.),  just  as  (xxi.)  is  already  identical  with  (xv,). 

Thus  we  see  that  the  whole  series  of  fecundity  relations  are  strikingly  like  those 
for  fertility,  except  that  in  certain  of  them — those  for  M"i,  M'^j  o""i  and  cr"2 — the 
correlation  p  of  fertility  and  fecundity  is  introduced.  If  p  be  considerable,  all  the 
remarks  we  have  made  on  the  fertility  formulae  may,  mutatis  mutandis,  be  applied  to 
the  measurement  of  fecundity. 


(5.)  Proposition  IV. — To  deduce  formidcB  for  finding  the  correlation  between  any 
grades  of  kindred  from  the  means  of  arrays  into  which  the  kindred  may  be  grouped. 

This  problem  is  of  very  great  practical  importance.  In  the  case  of  Man,  families 
are  so  small  that  there  is  com[)aratively  small  difficulty  in  forming  all  the  possible 
pairs  of  brethren,  say,  for  any  family  ;  but  when  we  come  to  animals  or  insects  where 
the  fertility  may  be  extremely  large,  it  is  practically  impossible  to  form  a  correlation 
table  involving  50,000  to  100,000  entries.*  One  thoroughbred  sire  may  have  50  to  80 
daughters,  and  thus  give  us  roughly  1200  to  3200  pairs  of  sisters  to  be  entered  in 
a  correlation  table.  Still  higher  results  occur  in  the  case  of  aunts  and  nieces.  It 
may  be  asked  why  we  do  not  content  ourselves  with  one  or  two  pairs  from  each 
parent ;  the  answer  is  simple  :  we  have  not  {e.g.,  in  the  case  of  thoroughbred  animals, 
pedigree  moths,  &c.)  a  great  number  of  sires,  and  the  sire  with  50  oflPspring  cannot, 
for  accuracy  of  result,  be  put  on  the  same  footing  as  the  sire  with  only  2  to  4.  Our 
process  is  really  an  indirect  weighting  of  our  results. 

(A.)  To  find  the  coefficient  of  correlation  between  brethren  from  the  means  of 
the  arrays. 

Let  X  be  the  measure  of  any  character  or  organ  in  one  brother  (sister),  and  x  that 
of  a  second  brother  (sister)  :  let  m  be  the  mean  of  one  set  of  brothers,  and  m'  of  the 


*  Even  with  the  reduction  in  labour,  introduced  by  this  proposition  and  by  the  use  of  mechanical 
calculators,  Mr.  Leslie  Beamlet-Mooke  and  I  took  practically  a  week,  of  eight-hour  days,  to  deduce 
two  coefficients  of  correlation,  after  the  means  of  the  aiTays  had  already  been  found. 
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second  set.  Let  n  be  the  number  of  brothers  in  an  array,  and  therefore  (n  —  1 )  the 
number  of  pairs  of  brothers  in  the  array.  Let  cr  and  cr'  be  the  standard  deviations 
of  the  two  sets  of  brothers,  and  r  the  coefficient  of  correlation  between  brothers  for 
the  organ  in  question.  Let  S  denote  a  summation  with  regard  to  all  pairs  of  brothers 
in  the  community,  and  %  with  regard  to  all  brothers  in  an  array.  Let  N  be  the 
total  number  of  brothers  in  the  community.  Then  if  we  selected  our  pairs  of  brothers 
for  tabulation  at  random  {e.g.,  not  by  seniority  or  other  character),  we  should  find 
m  =  m  and  cr'  =  cr.    Farther,  by  definition  of  correlation 

Nro-o-'  =S{x-m)  (x'  -  m)  =  St  {x  -  U  +  M  -  m)  {x'  _  M'  +  M'  -  m'), 

where  M  and  M'  are  the  means  of  the  two  sets  of  brothers  in  any  array  and  are 
clearly  equal. 

Further,  X  [x  —  M)  =  S  {x  —  M')  =  0,  when  summed  for  an  array,  and 
%{x  —  M)  [x  —  M')  =  0,  for  there  is  no  correlation  within  the  array  when  the 
deviations  are  measured  from  the  mean  of  the  array.    Hence  : 

Nmo-'  =  S  {In  (n  _  l)  (M  -  m)  (M'  -  m')}, 

or 

Nro-'  =  S  {in  (n  -  1)  M^}  -  2mS  {^n  (n  -  1)  M]  +  m^N  ; 

but 

S  {|n  (n  -  1)M}  =  Nm. 

Thus,  finally, 

S  {in(n  -  1)  M^l/N  - 
^=   ^   

This  can  be  written 

r  =  ai/a-''  

where  (t„  is  the  standard  deviation  of  the  arrays  concentrated  into  their  means  and 
loaded  with  their  sizes  ;  cr  is  the  standard  deviation  of  all  brethren  loaded  with  the 
number  of  times  they  are  counted  as  brethren  ;  m  is  the  mean  of  all  the  offspring 
loaded  with  the  number  of  times  they  are  counted  as  brethren. 

Let  cTo  be  the*  standard  deviation  of  ofispring,  and  p  the  correlation  between  parent 
and  offspring,  then  the  standard  deviation  of  an  array  of  offspring,  if  correlation  be 
sensibly  linear,*  will  be  o-q        —  p^).    We  have,  further, 

m  =  S  (x)  =  SS  (x  -  M  +  M)  =  S  {in  (n  -  1)  M], 
No--  =  S  (x  -  m)2  =      (cc  -  M  +  M  -       =  S  { S  (x  -  Mf  +  in  (n  -  1 )  (M  -  mf] . 

But 

t{x-  Uf  =  in  (n  -  1)  o-^  (1  -  p-). 


(xxiii.). 
(xxiv, ) 


*  See,  however,  the  first  footnote  p.  270. 
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Thus: 

Ncr^  =  Ncr-^  (1  -  p'}  +  No-L 

or  : 

0-^=0-^1  -p^)  +  (rf.  (xxv.), 

and  r  may  be  written  : 

"'■  =  <^(i   <™-)- 

Here  cr„  can  be  found  from  the  arrays,  and  ctq  and  p  will  in  many  cases  have  been 
previously  ascertained. 

(B.)  To  find  the  correlation  between  "uncles"  and  "nepheivs"  {"aunts"  and 
"  nieces  ")  from  the  means  of  the  corresponding  arrays. 

Let  Ui  be  the  number  of  uncles  in  an  array,  n^,  be  the  number  of  nephews  in  the 
associated  array,  so  that  ^1^2  is  the  number  of  pairs  of  uncles  and  nephews  provided 
by  the  associated  arrays.  Let  N  =  S  {yiin^  be  the  total  number  of  pairs  of  uncles 
and  nephews  in  the  community  under  considei-ation.  Let  x  be  the  measure  of  the 
organ  or  character  in  the  uncle,  x  in  the  nephew.  Let  M  and  M'  be  the  means  of 
two  associated  arrays  of  uncles  and  nephews  respectively.  Let  m  and  m'  be  the 
means  of  all  uncles  weighted  with  their  nephews  and  all  nephews  weighted  with 
their  uncles  respectively,  and  let  <r,  <t'  be  the  corresponding  standard  deviations 
under  the  same  circumstances  ;  r  the  correlation  of  uncle  and  nephew.    Then  : 

NrVo-'  =  S  (ir  -  m)  (x'  -  m')  =  -  M  +  M  -  m)  {x  -  M'  +  M'  -  m'). 

Now  '%{x  —  M)  —  '%{x'  —  M')  =  0,  and  within  the  arrays  there  is  no  association 
of  individual  uncles  with  individual  nephews,  i.e.,  S  (x  —  M)  (x  —  M')  =  0.    Thus  : 

Nr'o-o-'  =  S  {n,n^  (M  -  m)  (M'  ~  m')}  =  S  {n.nMW)  -  Nmw', 

since 

m  =  S  {n.nM)/^,       m'  =  S  {uynM.')!^. 

Thus : 

S  (%,?i,MM')/N  -  rwm' 
=  '-J^'  (^^^^i-)- 

.    If  (Ta  and  o-'a  be  the  standard  deviations  of  the  means  of  the  arrays  of  uncles  and 

nephews  and  R  the  correlation  of  these  means,  the  numerator  is  clearly  Ra-„a-'„. 
Thus : 

r  =  R  -=^^     (xx  viii. ). 


Here  the  numerator  as  a  whole  or  in  parts  is  easily  found  from  the  means  of  the 
VOL.  cxcii. — A.  2  N 
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arrays.  If  ctq  and  cr'o  be  the  means  of  unloaded  uncles  and  nephews,  we  note  that 
they  are  arrays  owing  to  common  parentage,  and  hence  their  array  standard 
deviations"'  will  be  o-q  \/l  —  and  cr'o  ^  1  —  /o^  p  being  the  standard  deviation  of 
parent  and  offspring.    As  before  we  find  : 


o"^  =  o-a  +  o-o,(l  —  p) 
o-   =  o-  a.-\-  cr  o{\  —  p  ) 


(xxviii.). 


If,  as  will  probably  be  the  case,  there  be  no  secular  change  between  uncles  and 

nephews,  then  cr  =  cr',  a.^  =  o-'„,  o-q  =  o-'o,  and  accordingly  r  =  Rcr^V*^^  5  whence,  using 
(xxiv.),  we  have  : 


r'  =  rxRx^-i  (xxix.). 

If  we  could  assume  cr„  =  cr„  and  cr  =  cr,  this  result  would  reduce  to  the  very 
simple  form : 

r'  =  r  X  E. 

Now  the  assumption  ctq  =  otq  is,  I  think,  legitimate,  for  the  distribution  for  an 
unloaded  array  of  nephews  or  uncles  should  be  sensibly  that  of  an  array  of  brethren. 
But  the  equality  of  cr„  and  cr„,  which  would  now  involve  that  of  cr  and  cr,  is  a  much 

more  doubtful  point.  cr„  and  o-„  mark  indeed  quite  different  systems  of  loading. 
Both,  it  is  true,  are  of  the  form 

S  {nn'W)  /  N  -  -I  S  {nn'M.)  /  N}^ 

but  in  the  case  of  brethren  n  —  ^  {n  —  1)  ov  7t  has  perfect  correlation  with  n,  while 
in  the  case  of  uncles  and  nephews  n  is  only  imperfectly  correlated  with  n.  The 
intensity  of  this  correlation  depends  upon  the  correlation  between  the  sizes  of  arrays 
of  uncles  and  nephews,  a  quantity  which  may  be  very  small,  or  not,  according  to  the 
nature  of  the  record.    Hence  it  appears  necessary  in  applying  the  method  to  make 

some  attempt  to  appreciate  the  value  of  cr„  as  well  as  cr„.  If  this  be  done  II  can  be 
found  from  (xxix.),  if  not  directly.  This  value  of  B  is  not  without  importance  for 
the  inheritance  of  characters  latent  in  one  or  other  sex. 

We  have  thus  reduced  the  correlations  of  individuals  to  a  calculation  of  the  corre- 
lation of  arrays. 

(6,)  Proposition  V. — To  Jind  a  measure  of  the  effect  of  mingling  uncorrelated 
material  ivitJi  correlated  material. 

The  importance  of  this  investigation  lies  in  the  fact  that  death,  restraint,  or  other 

*  Or,  again,  tlie  means  of  the  standard -deviations  of  the  arrays. 
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circumstances,  completely  screen,  in  a  certain  number  of  cases,  both  the  potential 
fertility  and  the  real  fecundity  of  man.  Precisely  similar  circumstances,  which  will  be 
considered  more  at  length  later,  hinder  our  obtaining  in  horses  a  true  measure  of 
fecundity  for  all  cases.  We  are  thus  really  dealing  with  a  mixture  of  correlated  and 
apparently  uncorrelated  material.  In  what  manner  does  the  influence  of  this 
mixture  effect  our  results  ? 

Let  a  group  N  consist  of  +  ^2  +  ^3  +  ^4  pairs  of  individuals.  Of  these,  in  the 
case  of  Wi  pairs,  both  individuals  have  the  true  value  of  the  character  under  investi- 
gation recorded  ;  in  the  case  of  Uo  pairs,  neither  have  the  true  value  recorded ;  in  the 
case  of  tig  pairs,  it  is  the  first  individual  of  the  pair  which  has  a  true  recorded  value, 
and  the  second  an  apparent  or  fictitious  value ;  lastly,  in  cases,  let  the  fictitious 
value  be  in  the  first  and  the  real  value  in  the  second  individual  of  the  pair.  Then 
there  will  be  no  correlation  between  individuals  in  the  groups  no,  n^,  ni.  Let  r  be 
the  correlation  in  the  group  rii  and  E,  that  observed  in  the  whole  group  of 
N  =  ni  +  W2  +  +  Let  x  be  the  measure  of  a  character  in  the  first,  x  in  the 
second  individual.  Let  M  and  M'  be  the  means  of  the  total  groups  of  the  two 
individuals  and  %,  %'  their  standard  deviations.  In  group  let  the  corresponding 
quantities  be  m-^,  m\,  o-i,  cr'i,  and  a  similar  notation  hold  for  the  other  sub-groups. 
Then  Wi  =  and  0-1  =  0-3;  m^  —  and  0-2  =  0-4 ;  while  m\  =  m\  and  a\  =  <t\  ; 
m'2  =  m'3,  o-'a  =  o-'g.  .  '  . 

We  have  at  once  : 

m-i  +       +      +  +  n.^  +      ■+  ' 

while 

jyj-'  _  (?h  +  n^)  ni\  +  {n,^  +  m\ 
+  ^^2  +  %  +  % 

Further : 

in,  +     +  n3  +      XS'R  =  S  (a;  -  M)  {x'  -  M'), 
by  the  usual  properties  of  product  moments 

=  Wjo-io-'ir  +     (mi  -  M)  (m'l  -  M')  +     [m^  —  M)  (w  2  -  M') 
+  723  K  -  M)  (m'3  -  M')  +     {m,  -  M)  {m\  -  M') 

=  ni-o-io-V'  +  nimini'i  +  n-^m^mz  +  n^rn^niz  +  niniim'i 

—  M (njm'i  +  n^m'o  +  n3m'3  -f  n^mi)  —  M'  {n^mi  +  n^m^  +  Wgrna  +  w^wi^) 

-f  MM'  (Wi+  72^  +  ^3+^4) 

=  nio-io-'ir  -f  n{m{m\  +  n^m^m^  +  ihrn^mr,  +  nimim\  —  MM'  {n^  -\-  n^). 

Substituting  the  values  of  M  and  M'  and  using  the  relations  between  the  m's,  we 
find  after  some  reductions  : 

2  N  2 
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Let  —  of  the  N  first  individuals  and  —  of  the  N  second  individuals  have  fictitious 
values,  then  ^  N  and  ^         N  will  have  their  true  values.    If,  now,  there  is  no 

p  q 

correlation  between  the  fictitious  values  in  the  two  cases,  we  have  at  once  : 

m  m  Pi  .  VI 

From  this  it  follows  at  once  that 

or  the  second  term  in  (xxx.)  vanishes.    Thus  : 

Thus  R  vanishes  with  r,  and  no  spurious  correlation  could  arise  from  the  existence 
of  fictitious  values  distributed  at  random  through  the  correlation  table.  This  result 
might,  indeed,  (as  it  often  is  tacitly)  be  assumed  by  some,  but  it  seems  very  desirable  to 
have  a  definite  proof. 

It  remains  to  consider  S  and        We  have  : 

=  (^'i  +  n^)  o-]  +       +  n^)  al 

+  (^'^1  +  W3)  ra\  +  («2  +  nj)mi  —  {n^  +  n2  +  ih  +  nj)  W, 


or 


=  (1  -  7)   +      +  (1  -  7)7"^"^'  ~  "^^^^ 

=  (r^  +  y(crl-crl)  +  (l-  ^)y(.H,-m,)^     .    ...  (xxxL). 

Similarly  : 

t'^  =  o-'\  +  Y  wi  -  ct'\)  +  (1  -  7)  7  (^'1  -  '^'2)'  •  •  (^^^"O- 


Now  if  the  introduction  of  the  fictitious  values  consisted  of  anything  of  the 
nature  of  a  wrong  pairing  of  certain  individuals,  we  should  simply  have  o-j  =  0-2, 
cr'i=  cr'2,  mi  =  W2,  wi'i  —  m'2  and,  accordingly,  2  =  cr,,  and     =  cr'i. 
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In  any  case,  if  the  percentage  of  fictitious  values  be  not  large,  the  second  and  third 
terms  are  of  the  second  order  of  small  quantities,  since  —  and       are  small.  The 

i  2J  q 

maximum  value  of  the  third  term  cannot  be  greater  than  ^  (wi  —  w/o)",  and  this 
will  be  relatively  small  in  the  cases  to  v/hich  we  shall  apply  it. 

For  example,  no  great  changes  are  made  in  o-,  when  we  vary  the  amount  of 
fictitious  cases  introduced  into  our  fertility  tables,  and  m^.  do,  however,  change. 
Thus  (Ti  =  0-2  =  3  approximately,  and  the  range      —      =  1  2.    Hence  : 

^2  =  9  +  i  (]  ■2)\  at  a  maximum,  =  9-36, 

or, 

t  =  3-06. 

Thus  in  this  extreme  case  there  is  only  2  per  cent,  change  in  the  value  of  %.  In 
such  cases  accordingly  we  may  take  for  rough  approximations  S  =  cr  and  %  =  cr'.  This 
leads  us  to  : 

Il=:^r  (xxxii.). 

Or,  the  reduction  of  correlation,  due  to  the  introduction  of  fictitious  values,  is 
obtained  by  using  as  a  factor  the  ratio  of  actual  correlated  pairs  of  individuals  to  the 
total  number  of  pairs  tabulated. 

This  result  will  be  of  considerable  service  when  we  come  to  deal  with  the  fecundity 
of  thoroughbred  racehorses. 

(7.)  Proposition  VI. — To  obtain  a  measure  of  the  spurious  correlation  apparently 
existing  betiveen  two  organs,  ivhen  a  mixture  is  made  of  heterogeneous  materials. 

Let  X  and  x'  be  measures  of  the  two  organs,  and  let  there  be  N  pairs  of  organs 
formed  by  i  heterogeneous  groups  containing  n^,  n.2,  %  .  .  .  pairs  with  means  m^,  m'l, 
W2,  m'2,  m^,  m\  .  .  .  ,  &c.,  standard  deviations  o-j,  (t\,  cto,  cr'2,  0-3,  (t\  .  .  .  ,  &c.,  and 
correlations  r^,  r.^,  r^  .  .  .  ,  &c.  Let  Mi  M'  be  the  means  of  the  whole  heterogeneous 
community,      ^'  the  standard  deviations,  and  R  the  correlation.    Then  : 

RXS'N  =  S  (Wr)  +  S  {n  (m  -  M)  (m'  -  M')], 

where  S  denotes  a  summation  with  regard  to  all  i  groups.  Now  if  there  were  no 
correlation  at  all  between  the  organs  in  any  one  of  the  i  groups,  II  for  the  hetero- 
geneous mixture  would  still  not  be  zero  so  long  as  the  second  summation  did  not 
vanish.  This,  then,  is  a  measure  of  the  spurious  correlation  produced  by  making 
a  mixture  of  uncorrelated  materials. 

Now  S  [n  [m  ■-  M)  (m'  —  M')j,  remembering  the  values  of  M  and  M'  may  be 
written : 
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s 


I       {^P  -  ^a)  i^'p  -       ]  (xxxiv. ) 


where  the  summation  S  now  refers  to  every  possible  pair     and  q  of  the  r  groups. 

Now  it  is  very  unHkely,  unless  i  be  very  large  and  the  numbers  nj,  n.^,  .  .  .  be 
taken  at  random,  that  this  expression  will  vanish.  Suppose  even  that  the  means 
of  our  heterogeneous  groups  were  uncorrelated,  i.e.,  S  (m  —  M)  (m'  —  M')  =  0,  it  is 
unlikely  that  S  {w  (m  —  M)  (m'  —  M')}  will  also  be  zero,  when  n  is  taken  at  random. 
With  a  comparatively  few  groups,  with  numbers  taken  at  random,  it  is  extremely 
improbable  that  the  principal  axes  of  the  i  points  loaded  with  Wi,  n^,  ih  .  .  .  will 
exactly  coincide  with  the  directions  of  the  axes  of  x  and  x'. 

We  are  thus  forced  to  the  conclusion  that  a  mixture  of  heterogeneous  groups, 
each  of  which  exhibits  in  itself  no  organic  correlation,  will  exhibit  a  greater  or  less 
amount  of  correlation.  This  correlation  may  properly  be  called  spurious,  yet  as  it 
is  almost  impossible  to  guarantee  the  absolute  homogeneity  of  any  community,  our 
results  for  correlation  are  always  liable  to  an  error,  the  amount  of  which  cannot  be 
foretold.  To  those  who  persist  in  looking  upon  all  correlation  as  cause  and  effect, 
the  fact  that  correlation  can  be  produced  between  two  quite  uncorrelated  characters 
A  and  B  by  taking  an  artificial  mixture  of  two  closely  allied  races,  must  come  rather 
as  a  shock. 

The  better  to  illustrate  this,  I  take  some  data  recently  deduced  by  Miss  C.  D. 
Fawcett.  She  finds  for  806  male  skulls,  from  the  Paris  Catacombs,  the  correlation 
for  length  and  breadth  '0869  i  '0236,  and  for  340  female  skulls,  from  the  same  locality, 
—  '0424  i  '0365.  The  existence  of  the  negative  sign  and  the  comparative  smallness 
of  the  correlation,  as  compared  with  the  probable  errors,  might  lead  us  to  assert  the 
correlation  between  the  length  and  breadth  of  French  skulls  to  be  sensibly  zero. 

If  now  the  two  sexes  be  mixed,  the  heterogeneous  group  has  for  correlation 
•1968  ±  '0192,  a  value  which  cannot  possibly  be  considered  zero.  Thus  the  mixture 
exhibits  a  large  spurious  correlation. 

Whether  any  given  mixture  increases  or  reduces  the  correlation  will  depend 
entirely  on  the  signs  of  the  differences  of  the  means  of  the  sub-groups.  But  the 
danger  of  heterogeneity  for  the  problem  of  correlation  will  have  been  made  manifest. 
If  the  value  of  II  for  any  mixture,  whose  components  are  known,  is  to  be  calculated, 
then  we  have  only  to  note  that : 

_  S(na^      S (n^n^  (m^  -  mj)  ^  _  S (na'^)      S(n^nj7n'^  -  m'J) 


*  Thus  the  mere  fact  of  breeding  from  two  or  three  individuals  selected  at  random  can  easily  produce 
a  correlation  between  organs  in  the  offspring,  which  has  no  existence  in  the  species  at  large. 
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II.  On  the  Inheritance  of  Fertility  in  Mankind.    By  Karl  Pearson,  F.R.S.,  and 

Alice  Lee,  B.A.,  B.Sc. 

(8.)  In  commencing  an  investigation  of  this  kind  where  the  results  to  be  expected 
were  quite  unknown  to  us,  but  where  we  had  reason  to  beheve  that  the  apparent 
strength  of  inheritance  must  be  very  small,  we  considered  that  the  first  thing  to  be 
done  was  to  investigate  the  largest  possible  amount  of  material.  Thus  the  probable 
errors  of  our  results  would  be  very  small  and  any,  however  small,  correlation  between 
fertility  in  parent  and  offspring  would  be  brought  to  light.  Attempts  might  then  be 
made  to  strengthen  any  correlation  discovered  by  removing  so  far  as  possible  one 
after  another  the  various  factors  tending  to  screen  the  full  effect  of  the  inheritance 
of  fertility. 

Such  factors  are  for  example  : 

(a.)  The  age  of  both  husband  and  wife  at  the  time  of  marriage.  The  real  fertility 
may  be  screened  by  late  marriages  of  one  or  both  parents.  The  relation  of  fertility 
to  age  at  marriage  has  been  dealt  with  by  several  writers,  notably  by  Duncan  and 
An  SELL. ''^ 

(6.)  The  duration  of  marriage.  The  data  may  be  taken  from  a  marriage  not  yet 
complete,  both  parents  being  still  alive.  Or  from  a  marriage  which  is  complete  one 
or  both  parents  being  dead.  In  the  former  or  the  latter  case  the  marriage  may  be 
complete  so  far  as  fertility  is  concerned,  i.e.,  details  of  offspring  may  be  available  till 
the  wife  has  reached  the  age  of  50  years,  which  for  statistical  purposes  may  be  taken 
as  an  upper  limit  to  fecundity. 

(c.)  Bestriction  of  fertility  during  marriage.  It  has  been  shown  in  a  paper  on 
Reproductive  Selectiont  that  there  is  evidence  of  the  sensible  influence  of  this  factor 
in  man.  It  tends  to  give  fictitious  values  to  the  fertility  of  the  younger,  rather  than 
the  elder  generation,  and  so  obscures  the  correlation. 

We  have  accordingly  two  problems  before  us  : 

(i.)  Supposing  these  and  other  factors  tending  to  screen  the  effects  of  reproductive 
selection  to  exist,  can  we  show  that  it  still  produces  sensible  effects  in  the  case  of 
man,  and  thus  demonstrate  that  fertility  is  really  inherited  ? 

(ii.)  Can  we  by  eliminating  these  factors  so  far  as  possible  obtain  a  lower  limit  to 
the  coefficient  of  heredity  in  the  case  of  fertility,  and  ascertain  whether  it 
approximates  in  value  to  what  we  might  expect  from  the  Law  of  Ancestral 
Heredity  ? 

The  first  impression  of  the  reader  may  be  that  it  is  only  needful  to  select  the 

*  J.  Mathews  Duncan,  'Fecandifcy,  Fertility,  Sterility  and  Allied  Topics,'  second  edition, 
Edinburgh,  1871.  Chaeles  Anbell,  Junr.,  '  Statistics  of  Families  in  the  Upper  and  Middle  Classes,' 
London,  1874. 

t  '  The  Chances  of  Death  and  other  Studies  in  Evolution,'  vol.  I,  pp.  77,  89. 
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fertility  of  marriages,  which  were  formed  with  husband  and  wife  between  20  and 
28*  say,  and  which  have  lasted  till  the  wife  is  over  50.  But  these  conditions  must 
be  true  in  tivo  successive  generations,  and,  had  we  adopted  them,  we  may  safely  say 
that  without  immense  labour  it  would  have  been  impossible  to  collect  even  a 
thousand  cases.  From  the  whole  of  the  peerage,  the  baronetage,  the  landed  gentry, 
a  variety  of  family  histories,  of  private  pedigrees,  and  a  collection  of  data  formed  of 
families  at  first  hand,  it  was  not  possible  to  extract  more  than  about  4000  cases  for 
the  inheritance  of  fertility  in  the  female  line,  when  the  limitations  were  far  less 
stringent,  being  applied  only  to  one  generation,  and  consisting  in  our  taking  marriages 
entered  into  at  any  time  of  life  for  either  husband  or  wife,  and  lasting  till  the  death 
of  one  member  or  for  at  least  fifteen  years.  Even  in  this  case  the  pedigree  of  the 
wife  had  to  be  sought  for  from  one  record  to  another  and  often  in  vain.  It  is  the 
male  pedigree  with  which  the  recorder  in  nearly  all  cases  occupies  himself. 

Only  those  who  l)ave  attempted  the  labour  of  extracting,  as  has  been  done  in  this 
case,  some  16,000  separate  returns,  will  fully  grasp  the  difficulty  of  making  the 
limitations  of  selection  more  and  more  complex  ;  the  quantity  to  be  obtained  becomes 
dangerously  small  and  the  labour  immensely  increases.  Even  could  with  time  and 
patience  a  sufficient  selection  of  ideal  cases  have  been  made,  it  does  not  follow  that 
the  result  would  be  satisfactory  ;  for,  we  should  have  made  a  narrow  selection,  and 
this  very  fact  might  indicate  that  possibly  we  have  been  selecting  one  grade  or  class 
of  fertility.  It  is  possible  that  the  less  fertile  are  the  weaker,  and  so  more  liable 
to  die  early ;  or  again  it  may  be  the  more  fertile  women  who  are  subjected  to  the 
more  frequent  risk  of  childbed,  and  thus  are  less  likely  to  appear  in  the  selection  of 
long  marriages.  Even  greater  or  less  risk  at  birth  may  be  an  inherited  character 
in  women,  and  may  not  unfairly  be  looked  upon  in  itself  as  a  factor  limiting  fertility 
naturally. 

Taking  these  points  into  consideration,  it  seemed  that  if  we  were  to  have  enough 
material  to  draw  conclusions  from  we  must  entirely  drop  all  attempt  to  classify  by 
age  of  parents  at  marriage.  We  might  make  some  limitations  but  they  must  not  be 
very  stringent ;  they  must  leave  room  for  an  increase  of  stringency  in  different 
directions,  so  that  we  could  roughly  appreciate  the  influence  of  the  screening  factors. 
Accordingly  our  plan  has  been  to  show  that  correlation  actually  does  exist  between 
parent  and  offspring  with  regard  to  fertility,  and  that  when  we  make  the  conditions 
more  stringent  the  correlation  increases  towards  the  value  indicated  by  the  law  of 
ancestral  heredity. 

(9.)  On  the  Inheritance  of  Fertility  in  Woman. — (i.)  Table  I.  gives  the  result  for 
4418  cases  of  the  fertility  of  a  mother  and  of  her  daughter.  These  were  extracted 
from  Foster's  '  Peerage  and  Baronetage,'  Burke's  '  Landed  Gentry,'  some  family 

*  As  Duncan  points  out,  an  early  marriage  on  the  average  means  an  earlier  cessation  of  fecundity;  a 
somev.liat  later  one  does  not  necessarily  connote  less  fertility. 
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histories  and  a  collection  of  family  data  drawn  from  private  pedigrees  and  other 
sources.  In  the  case  of  the  daughter,  no  marriage  was  taken  which  had  not  lasted 
at  least  15  years,  or  until  the  death  of  husband  or  wife.  In  the  case  of  the  mother 
no  limitation  whatever  was  made,  the  number  of  brothers  and  sisters  of  the  daughter, 
including  herself,  being  counted.  Weight  was  given  to  the  fertility  of  the  mother, 
for  every  possible  case  that  could  be  got  from  the  records  under  the  above  conditions 
was  extracted.  It  is  quite  possible  that  a  certain  proportion  of  offspring  dying  in 
early  infancy  have  not  been  entered  in  the  records. 

If  M„„  M,i  be  the  mean  fertilities  of  mother  and  daughter,  cr„„  cr^  their  standard 
deviations,  and  r,^,i  their  correlation,  we  found  : 

M,  =  3'494,  M,,  =  6-225, 

cr,  =  2-975,  o-„  =  3-052, 

=  -0418. 

Clearly  owing  to  the  near  equality  of  ctj,  and  o-,,,  the  regression  of  daughter's  on 
mother's  fertility  is  sensibly  equal  to  the  correlation. 

The  probable  error  of  r,„j  is  determined  by  the  formula  given  by  Pearson  and 
Filon'''  to  be  "0101,  or  r,,^,i  is  four  times  its  probable  error. 

We  thus  conclude  : 

(i.)  That  fertility  is  inherited  in  the  female  line. 

(ii.)  That  its  effects  are  very  largely  screened  by  the  factors  to  which  we  have 
previously  referred. 

Had  we  started  with  no  limitation  as  to  the  daughter's  family,  it  is  highly  probable 
that  r„„j  would  scarcely  have  been  sensible  relatively  to  its  probable  error,  and, 
therefore,  small  series  without  due  regard  to  screening  causes  may  easily  lead  the 
recorder  to  suppose  that  fertility  is  not  inherited. 

Supposing  we  exclude  from  the  daughters  the  775  barren  marriages,  we  find  the 
mean  for  3643  cases  of  fertile  marriages  to  be  5*237.  Comparing  this  fertility  with 
the  observed  fertility  6-225  of  mothers,  a  superficial  inquirer  might  at  once  consider 
that  a  diminution  of  fertility  has  taken  place.  The  fact  is  that  neither  of  the  results, 
\  M,„  or  M^j  gives  the  actual  fertility  of  the  mothers  or  daughters.  These  are  the 
means  M"i  and        of  formulae  (viii.)  and  (vii.)  of  the  theoretical  investigation. 

Let  us  apply  the  theory  developed  to  our  statistics.  In  the  first  place  we  note 
that  r  is  small ;  hence  is  still  smaller,  and  thus  by  (xv.)  a'^  will  not  differ  much 
from  (To.  Since  ar'\  will  be  generally  less  than  ctj  by  (xvi,),  it  follows  that  a" 2  will 
probably  be  less  than  0-0.  Approximately,  we  can  take  cr.,  =  3.  Turning  to  (vii.)  we 
see  that  M"o  cannot,  since  r  is  small,  differ  widely  from  M2.    If  there  be  no  secular 


*  "  Contributions  to  Theory  of  Evolution.— IV."    '  Phil.  Trans.,'  A,  vol.  191,  p.  242. 
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evolution  in  the  real  fertility  sensible  in  the  one  generation,  then  Mj  would  equal 
Mo.    Hence  to  a  first  approximation  we  should  have  : 

Ml  =  M,  =  M"2  =  3-494. 

To  obtain  a  second  approximation  we  may  substitute  this  in  the  small  terms  of 
(vii.).    Here  cr'l  must  be  found  from  (ii.)  ;  neglecting  the  cubic  term  we  have  : 

o-'i/o-r  =  1  -  a-ymi  =  -2628. 

Hence  : 

M.  =  M3  =  3-494  + 

=  3-494  —  -0418  X  2-5759  X  I'lSlS 
=  3-494  — -124  =  3-370. 

We  can  now  substitute  this  value  of  Mi  in  (viii,),  and  we  find  : 

M"i  =  3-370  +  2-980  =  6-350. 

This  differs  comparatively  little  from  the  actually  observed  value,  0*225,  and  is 
satisfactory  evidence  of  the  validity  of  our  theory.  The  fact  that  the  elder  generation 
was  in  no  way  limited  like  the  younger,  and  that  we  have  neglected  the  third 
moment — although  fertility  distributions  are  never  normal — as  well  as  made  other 
approximations,  is  quite  sufficient  to  account  for  the  difference  observed. 

We  may  take  it  that  3-4  is  practically  the  fertility  of  the  elder  generation,  and 
that  this  is  raised  to  about  3-5  by  reproductive  selection  in  the  younger  generation. 
The  result  6-2  for  the  elder  generation  is  thus  purely  a  result  of  weighting  due  to  the 
nature  of  the  record. 

(ii.)  Table  II.  gives  the  result  of  1000  cases  taken  from  the  Peerage.  Here  the 
conditions  of  extraction  were  as  follows  : — 

One  member  only  was  taken  out  of  each  family,  or  no  weight  was  given  to  the 
fertility  of  mothers. 

The  daughters'  marriages  had  all  been  completed  by  the  death  of  one  parent  or  had 
lasted  at  least  15  years. 

There  was  no  limitation  with  regard  to  the  parents'  marriages.    We  found  : 

M,  =  3-923,  M„  =  5-85G, 

=  2-758,  cr„  =  2-751, 

r,„,  =  -2096. 

The  coefficient  of  regression  is  sensibly  equal  to  that  of  correlation.  The  probable 
error  of  r^,,  =  '0204,  or  not  a  tenth  of  the  value  of       itself    Again  we  conclude 
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that  fertility  is  certainly  inherited  in  the  female  line.  By  selecting  fairly  homogeneous 
material  with  a  more  definite  and  complete  record  than  exists  for  the  heterogeneous 
material  of  the  previous  case,  we  have  carried  up  the  correlation  to  five  times  its 
previous  value,  and  within  a  reasonable  distance  of  the  value  '3  which  would  be 
required  by  the  law  of  ancestral  heredity.  The  homogeneity  of  our  material  is 
evidenced  by  the  reduction  in  both  standard  deviations  ;  the  greater  completeness  of 
the  record  by  the  rise  in  the  fertility  of  daughters  ;  and  the  non-weighting  of  the 
fertility  of  mothers  by  the  fall  in  their  mean  fertility. 

If  the  reader  will  turn  back  to  the  theory  of  the  influence  of  heterogeneity  on 
correlation  in  section  (7),  he  will  notice  that  the  expression  in  (xxxiv.)  will  be  negative, 
and  therefore  the  apparent  correlation  less  than  the  real,  if  we  form  a  mixture  of  two 
gi'oups  in  which  'ni^,  >  m,^  and  m'^  <  m'^.  Now  the  entries  of  women  in  the  Landed 
Gentry  and  other  records  are  very  often  entries  of  "  heiresses,"  while  the  entries  of 
women  in  the  Peerage  are  entries  because  of  class.  An  "  heiress"  naturally  has  fewer 
brothers  and  sisters  than  another  woman  on  an  average,  or  we  may  expect  >  -m'^,. 
On  the  other  hand  an  "  heiress  "  need  not  have  fewer  children  than  other  women, 
unless  her  heritage  is  the  result  of  her  coming  from  an  infertile  stock,  and  is  not  a 
result  of  the  incompleteness  of  her  parents'  marriage.  If  she  belongs  to  a  somewhat 
lower  social  grade,  she  may  possibly  be  more  fertile  than  the  average  of  a  higher 
social  grade.  In  this  case  m^,  will  be  >  7n,^,  and  when  we  come  to  mix  records  of  the 
Peerage  with  those  of  the  Landed  Gentry  and  Family  Histories,  we  need  not  be 
surprised  to  find  the  correlation  of  fertility  much  weakened,  as  it  undoubtedly  is  (as 
shown  by  (i.)  and  (ii.)  above)  by  the  mixture. 

Let  us  next  apply  our  theory  to  the  above  results.  We  are  now  dealing  with 
M'l,  M'2,  cr'i,  (t'z-  Assuming  that  there  is  no  secular  change  cti  =  a.^-  and  accordingly 
since  cr'i  =  a-'-,  sensibly,  formula  (xi.)  shows  us  that  both  =  cti. 

Further,  if  Mi  =  jVL,  formula  (v.)  is  a  quadratic  equation  to  find  Mj ;  substituting 
for  M'l,  (Tj,  and  r,  we  have,  on  solving  and  taking  the  only  admissible  root.  Mi  =:  3"4625, 
Then,  applying  formula  (i.)  to  find  M'l,  we  have  : 

M'l  =  5'6G0. 

This  is  not  quite  as  high  as  the  observed  value  5 "850,  but  it  sufiices  to  show  that 
our  theory  expresses  the  main  facts.  In  all  probability  we  have  not  entirely  freed 
our  results  from  weighting  with  fertility  ;  because,  although  every  endeavour  was 
made  to  take  only  one  from  each  family,  it  is  possible  that  pairs  of  sisters  have  occa- 
sionally crept  into  the  record. 

(iii.)  Table  III.  gives  the  result  of  1000  cases  taken  from  the  Landed  Gentry,  As 
we  have  already  noted,  the  women  recorded  are  largely  "  heiresses,"  and  we  beheved 
this  might  be  one  of  the  chief  sources  of  the  heterogeneity  of  the  material  in  Table  I. 
The  conditions  of  selection  were  made  somewhat  more  stringent,  and  were  as  follows  : — 
Only  one  daughter  was  taken  from  each  family,  and  her  marriage  must  have  lasted  at 
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least  15  years.  No  limitation  Avas  placed  on  the  duration  of  the  parents'  marriacre. 
We  found  : 

=  4-232,  M„  =  5-403, 

cr,  =  3-292,  cr^  =  3-241, 

=  -1045. 

The  probable  error  of  r,„^  =  -0211,  and  again  we  see  that  fertility  is  certainly 
inherited.  The  correlation  has,  however,  sunk  ;  probably,  as  the  great  increase  of 
variation  indicates,  because  we  are  dealing  with  much  more  heterogeneous  material  than 
in  the  case  of  the  Peerage.  While  the  selection  of  "  heiresses  "  has  largely  reduced 
the  number  of  brothers  and  sisters,  i.e.,  the  fertility  of  mothers,  the  limitation  to 
marriages  of  at  least  15  years  has  increased  the  apparent  fertility  of  daughters;  nor 
is  this  increase  at  all  balanced  by  the  fact  that  heiresses  come  from  small  families,  and 
may,  therefore,  be  supposed  to  be  the  children  of  rather  sterile  mothers.  The  average 
number  of  children  of  heiresses  is  sensibly  as  large  as  the  average  number  of  children 
of  women  who  are  not  in  the  bulk  heiresses,  and  who  have,  as  in  the  following  case, 
been  selected  with  the  same  condition  as  to  duration  of  marriage.  The  fact  is  that 
heiresses  are  not  on  the  whole  the  children  of  sterile  mothers ;  their  high  fertility  and 
their  small  correlation  tvith  their  mothers  shows  us  that  heiresses  in  the  bulk  are 
rather  the  daughters  of  mothers  whose  apparent  fertility  is  fictitious.  They  have, 
owing  to  the  sterility  or  early  death  of  their  husband,  to  their  own  marriage  late  in 
life,  or  to  some  physical  disability,  or  other  restraint,  never  reached  their  true  fertility. 
If  this  conclusion  be  correct,  and  a  comparison  of  the  values  of  M^^  and  r,,,^  in  this  and 
the  following  cases  thrusts  it  almost  irresistibly  upon  us,  then  we  see  that,  the 
argument  against  the  inheritance  of  fertility  based  upon  the  fertihty  of  heiresses  and 
non-heiresses  is  of  no  validity.^*  It  could  not  be  valid  as  against  the  values  of  the 
correlation  Ave  have  found,  but  the  present  investigation  shows  by  the  value  of  r„j 
exactly  wherein  the  error  lies  :  the  heiress  is  not  infertile,  but  is  the  daughter  of  a 
fictitiously  infertile  mother. 

Applying  our  theory  to  this  case,  we  find  from  formula  (xi.),  putting  o-j  =  0-2  : 

o-'l  =  (o-'i  —  r-a''i)l{l  —  r-), 

whence  we  find  (Tj  =  3'293,  a  result  sensibly  identical  with  0*^,  Solving  the  quadratic 
(v.)  with  Ml  =  M2  to  find  Mi  we  find  : 

Ml  =  3-952. 

Hence  by  (i.)  we  have  : 

M',  =  6-838, 

the  actually  observed  value  being  5 '403.  Thus  the  theory  completely  fails  to  give 
the  fertility  of  the  heiresses'  mothers  ;  for  such  a  fertility  as  we  find  in  the  daughters, 

*  Sec,  for  example,  a  recent  letter  of  Mr.  Hovtakd  Collixs  in  '  Nature,'  November  3,  1898. 
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the  mothers'  fertility  is  far  too  low.  This  again  emphasises  the  point  we  have 
already  referred  to.  The  peculiar  character  of  the-  selection,  which  leads  to  the 
female  record  in  the  Landed  Gentry,  is  not  one  such  as  we  have  considered  in  our 
theory,  where  the  record  of  any  family  is  likely  to  appear  in  proportion  to  its  size. 
Such  a  distribution  is  a  chance  distribution,  but  a  selection  of  women  inheriting  land 
has  not  this  character,  and  a  woman  who  is  the  mother  of  co-heiresses  is  hardly 
doubly  as  likely  to  appear  as  the  woman  who  is  mother  of  one.  A  marriage  in  either 
case  is  likely  to  be  arranged,  and  if  we  take  only  one  daughter  from  each  family  the 
record  will  not  already  have  weighted — at  any  rate  to  the  full  extent — every  mother 
with  her  fertility.  If  the  reader  will  compare  the  variation  columns  for  both  daughters 
and  mothers  in  Table  II L  with  the  corresponding  columns  in  Table  II.  or  Table  IV., 
he  will  at  once  see  how  anomalous  is  the  selection  of  women  given  in  the  Landed 
Gentry. 

(iv.)  Table  IV.  gives  the  results  for  1000  cases  taken  from  the  Peerage  and 
Baronetage  under  the  following  limitations  :  one  daughter  only  was  taken  for  each 
mother,  and  in  the  case  of  both  mother  and  daughter  the  marriage  must  have  lasted 
at  least  1 5  years.    We  found  : 

=  4-335,  M„  =  5-898, 

o-d  =  2-967,  o-„;  =  2-830, 

r„,,  =  -2130. 

The  probable  error  of  r^^  =  "0204.  Thus,  as  it  is  now  hardly  necessary  to  repeat, 
fertility  is  certainly  and  markedly  inherited.  The  regression  coefficient  is  now  as 
high  as  -2233,  the  closest  limit  we  have  yet  reached  to  the  theoretical  '3  of  the  law 
of  ancestral  heredity. 

Owing  to  the  limitation  to  marriages  of  15  or  more  years,  the  means  of  the 
fertilities  of  both  mothers  and  daughters  have  risen,  in  the  latter  case  more,  how- 
ever, than  the  former.  It  might  have  been  expected  that  the  fertility  of  mothers 
would  have  risen  more,  but  it  must  be  remembered  that  M,„  is  the  apparent  and  not 
the  real  fertility  of  mothers  ;  and  further,  since  the  record  largely  weights  the  more 
fertile  women,  the  bulk  of  the  mothers  are  already  those  with  large  families,  i.e., 
those  whose  marriages  have  lasted  at  least  15  years. 

Assuming  that  there  is  no  sensible  secular  change  in  unweighted  fertility,  i.e., 
tTi  =  (To,  we  have  from  the  formula  on  p.  284  : 

0-1  =  2-973. 

From  (v*)  with  Mi  =  M,  we  find : 

Ml  =  3-845 

for  the  real  fertility  of  mothers.  This  is  a  sensible  increase  on  the  value  3*463  given 
in  Case  (ii.),  in  which  there  was  no  minimum  duration  to  the  length  of  the  mother's 
marriage. 
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Applying  formula  (i.)  we  find: 

M'i  =  6-144, 

whicli  is  somewhat  more  than  the  observed  value  5*898.  The  reason  for  this  lies, 
we  think,  in  the  difficulty  already  referred  to  on  p.  263.  If  we  start  extracting 
mothers,  it  is  often  difficult  to  follow  the  daughter's  history  ;  starting  with  the 
daughter  it  is  much  easier,  although  still  laborious,  to  trace  back  her  ancestry,  and 
find  the  number  of  her  brothers  and  sisters.  Even  in  this  case  the  search  may  be 
lengthy.  But  as  daughters  when  married  change  their  name,  it  requires  great  care 
in  extracting  large  quantities  to  be  sure  that  a  mother  is  not  repeated,  i.e.,  some 
approach  made  to  weighting  her  with  her  fertility.  Every  care  was  made  in 
extracting  the  records,  but  we  cannot  hope  to  have  always  avoided  weighting  to 
some  extent  a  mother,  and  if  this  be  done  we  shall  have  a  transition  from  formulae 
(xi.),  (v.),  and  (i.)  towards  formulsB  (xv.),  (vii.),  and  (viii.),  which  would  well  account 
for  the  difi'erence  found  between  theory  and  observation. 

If  we  sum  up  for  inheritance  of  fertility  in  the  female  line  on  the  basis  of  these 
four  cases,  we  draw  from  each  one  of  them  the  unquestionable  result  that  fertility  in 
woman  is  an  inherited  character.  Further,  the  more  w^e  remove  causes  of  fictitious 
values  for  the  fertility  in  either  generation,  the  closer  does  the  value  approach  that 
I'equired  by  the  law  of  ancestral  heredity.  The  two  chief  disturbing  factors  which 
we  have  not  been  able  to  eliminate  are  [a.)  the  age  at  which  marriage  is  entered 
upon,  (Jj.)  restraint  giving  a  fictitious  value  to  the  fertility.  Both  these  causes  must 
give  a  lessened  value  to  the  correlation  of  fertility  between  mother  and  daughter, 
and  the  first,  judging  from  the  great  influence  of  age  at  marriage  on  fertility,  cannot 
fail  to  give  a  serious  diminution.  Hence  if  we  find  the  regression  coefficient  as  high 
as  •2233,  when  we  neglect  these  factors,  it  is  no  stretching  of  facts  to  conclude  that 
it  would  in  all  probability  rise  to  '3  could  we  take  them  into  account. 

Our  conclusion,  therefore,  is  that  fertility  in  woman  is  certainly  inherited  through 
the  female  line,  and  most  probably  according  to  the  law  of  ancestral  heredity. 
Beproductive  selection  is  actually  a  vera  causa  of  progressive  change,  but  its 
influence  is  largely,  if  not  entirely  screened  by  the  numerous  fiictors  tending  to  make 
the  apparent  fertility  of  women  difl'er  from  their  real  or  potential  fertility. 

(10.)  On  the  Inlieritanoe  of  FertiliLy  in  Man. 

(i.)  While  many  of  the  difficulties  involved  in  the  extraction  of  data  for  women 
still  exist  for  man,  a  new  and  important  feature  tending  to  screen  the  full  influence 
of  the  law  of  ancestral  heredity  arises  in  his  case.  The  full  fertility  of  the  husband 
is  not  in  the  average  case  at  all  approached  in  the  case  of  monogamic  marriage. 
Hence,  in  considering  the  size  of  a  man's  family  as  a  measure  of  his  fertility  we 
are  measuring  a  character  which  differs  largely  from  the  character  of  fertility  in 
woman.  It  is  only  in  the  case  of  sterile  or  even  very  sterile  men  that  there  is 
likely  to  be  a  correlation  shown  between  the  sizes  of  the  families  of  fathers  and  sons. 
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The  intensity  or  duration  of  fecundity  in  the  husband  must,  one  or  other,  be  less 
than  that  of  the  wife, — and  this  .will  hardly  be  so  in  the  great  run  of  cases — if  his 
family  is  to  be  in  any  way  a  measure  of  his  fertility,  or,  as  it  might  be  better 
to  call  it  in  this  case,  his  sterility.  We  are  seeking  to  find  a  correlation  between 
two  characters,  one  in  father  and  one  in  the  son,  neither  of  which  we  can  measure 
unless  they  fall  short  of  a  certain  limit.  The  result  is  that  our  correlated  material  is 
weakened  down  by  the  admixture  of  a  mass  of  uncorrelated  material  in  the  manner 
indicated  in  Proposition  V.  of  the  theoretical  part  of  this  investigation.  Within  the 
family  we  cannot  hope  to  get  a  correlation  which  will  approach  that  indicated  by  the 
law  of  ancestral  heredity.  We  may  still,  however,  hope  to  ascertain  whether 
fertility,  respectively  sterility,  is  an  inherited  character  in  man  as  well  as  woman. 

(ii.)  Our  first  attempt  was  to  collect  as  much  material  as  possible,  so  that  our 
limitations  were  few.  The  Peerage,  Baronetage,  Landed  Gentry,  Family  Histories, 
private  pedigrees,  and  collected  data  provided  the  6,070  cases  arranged  in  Table  V. 
Here  large  families  were  weighted  because  several,  where  available,  were  taken  from 
one  family.  The  son's  marriage  must  either  have  lasted  till  the  death  of  one  partner 
or  at  least  15  years;  there  was  no  condition  as  to  the  duration  of  the  father's 
marriage. 

We  have  spoken  of  the  correlation  between  fertility  of  father  and  son,  but  since 
only  a  single  marriage  of  the  father  is  taken,  it  may  be  equally  well  termed  a 
correlation  between  the  fertility  of  the  mother  and  son,  which  may,  perhaps,  to  some 
extent  explain  the  relatively  high  values  reached. 

Let  M^,  o-j  be  the  mean  and  standard-deviation  of  the  son,  cr^,  of  the  parent, 
and      the  correlation  ;  then  we  found  : 

M,  =  3-871,      M^=  5-831, 
0-,  =  3-003,        o-,,  =  3-190, 
r,^,  =  -0514. 

The  probable  error  of  r,^  =  '0087.  Thus  the  correlation  is  nearly  six  times  the 
probable  error,  or  fertility  in  man  is  certainly  inherited. 

(ili.)  Table  VI.  contains  the  result  of  extracting  1,000  cases  from  the  Peerage,  only 
one  son  being  taken  from  each  family,  and  his  marriage  having  lasted  at  least  15 
years.    No  attention  was  paid  to  the  length  of  parents'  marriage. 

We  found  : 

M,  =  5-070,      M,,  =  5-827, 
cr,  =  2-910,       (Xp  =  3-142, 
r,p  =  -0656. 

The  probable  error  of  r,p  =  -0212.  This  case  closely  confirms  the  previous  case; 
M.p  and  a-p  remain  sensibly  the  same,  M,  has  risen  owing  to  the  longer  period  of 
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duration  of  the  son's  marriage,  and  since  there  is  a  longer  period  for  the  possible 
exhaustion  of  the  male  fertility,  we  find  r,^  is  slightly  larger.  Although  the  numbers 
are  smaller  than  in  Case  (i.),  the  probable  error  is  not  so  large  but  that  we  can  still 
assert  an  inheritance  of  fertility  in  man. 

(iv.)  Lastly,  to  compare  with  Case  (iii.)  for  women,  1000  cases  were  extracted 
from  the  Landed  Gentry,  and  are  given  in  Table  VII.  Here  no  marriage  of  the  son 
or  parents  was  taken  under  a  minimum  of  15  years'  duration,  and  only  one  son 
taken  from  each  family,    We  found  : 

M,  =  5-304,  =  6-272, 

=  2-951,       o-^  =  2-911, 
r,^,  =  -1161. 

Thus  the  longer  duration  of  the  marriage,  which  gives  a  greater  chance  for  the 
exhaustion  of  the  fertility  of  a  partially  sterile  father,  leads  to  an  increased  corre- 
lation.   The  probable  error  here  is  -0210,  and  the  correlation  is  thus  unquestionable. 

It  would  be  idle  to  apply  the  theory  before  developed  to  these  male  cases,  for  the 
simple  reason  that  we  must  certainly  look  upon  them  as  containing  a  large  proportion 
of  uncorrelated  material.  But  they  suffice  to  show  that  male  fertility  is  an  inherited 
character,  and  although  the  results  are  widely  different  from  those  indicated  by  the 
law  of  ancestral  heredity,  they  are  large  when  we  consider  how  little  male  fertility 
appears  measurable  by  the  results  of  monogamic  marriage.  Were  an  approximately 
close  measure  of  male  fertility  available,  there  is  certainly  in  the  above  results  no 
reason  to  induce  us  to  believe  that  it  would  not  be  found  to  obey  the  law  of  ancestral 
heredity. 

(11.)  On  the  Inheritance  of  Fertility  in  Woman  through  the  Male  Line. 

Although  we  are  not  able  to  measure  the  potential  fertility  of  the  male,  we  are 
able  to  determine  whether  he  transfers  fertility  from  his  mother  to  his  daughter. 
This  may  be  simply  done  by  correlating  the  fertility  of  a  woman  and  that  of  her 
paternal  grandmother.  This  problem  belongs  to  an  important  class — namely, 
questions  as  to  the  extent  to  which  a  sexual  character  is  inherited  through  the 
opposite  sex.  Darwin  has  touched  upon  this  "  transmission  without  development "  in 
Chapter  viii.  of  the  '  Descent  of  Man,'*  and  we  shall  find  his  views  amply  verified. 

The  problem  before  us  is  :  Does  a  woman  have  as  close  correlation  with  her  paternal 
as  with  her  maternal  grandmother  in  the  matter  of  fertility  ? 

To  solve  this  problem  1000  cases  were  taken  out  of  the  Peerage  for  the  fertility  of 
a  woman  and  of  her  paternal  grandmother.  The  marriages  of  the  woman  and  of  her 
grandmother  were  both  taken  with  a  minimum  duration  of  fifteen  years.  Every  care 
was  taken  that  no  weight  should  be  given  to  fertile  families  by  taking  only  one  out 


*  Second  Edition,  p.  227,  '  Laws  of  Inheritance,' 
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of  each  family,  but,  of  course,  the  difficulty  of  avoiding  this  is  increased  when  a 
pedigree  must  be  traced  through  three  instead  of  two  generations. 

If  d  denotes  granddaughter,  g  grandmother,  the  following  results  were  obtained 
(Table  VIII.) 

M„=  4-411,         M„=  5-657, 
o-,,  =  2-897,  CT,  =  3-056, 

r„,=  -1123. 

The  coefficient  of  regression  of  daughter  s  fertility  on  grandmother's  fertility  =  -1065. 
The  probable  error  of      =  -0211. 

According  to  the  law  of  ancestral  heredity""  we  should  expect  the  grandparental 
correlation  and  regression  to  be  half  the  parental  and  equal  to  -15.  Comparing  the 
present  result  with  Case  (iv.),  we  see  that  -1123  and  -1065  have  to  be  compared  with 
i(-2130)  and  l(-2233),  or  with  -1065  and  -1116.  These  are  differences  well  within 
the  probable  error  of  our  results,  or  we  may  conclude  that  the  correlation  of  a  woman 
with  her  paternal  grandmother  is  exactly  what  from  Case  (iv.)  of  Section  (9)  we 
should  expect  to  find  for  her  correlation  with  her  maternal  grandmother.  The  reduc- 
tion from  -15  to  -1123  is  just  what  we  might  have  predicted  after  the  maternal 
reduction  from  -3  to  '2130,  We,  therefore,  conclude  that  the  fertility  of  woman  is 
inherited  through  the  male  line  with  the  same  intensity  as  through  the  female,  and 
this  intensity  is  most  probably  that  which  would  be  indicated  by  the  law  of  ancestral 
heredity. 

(12.)  We  do  not  stay  to  consider  many  points  which  flow  from  our  tables,  such,  for 
example,  as  the  amount  of  restraint  indicated  by  the  hump  at  the  start  of  our  various 
frequency  distributions  for  size  of  families,  partly  because  such  consideration  would 
lead  us  beyond  our  present  scope,  the  inheritance  of  fertility,  and  partly  because  this 
point  has  been  already  dealt  with  by  one  of  us  in  a  paper  on  '  Reproductive  Selection.' 
We  consider  that  we  have  shown  fertility  in  mankind  to  be  an  inherited  character  in 
both  lines,  and  probably  obeying  the  law  of  ancestral  heredity .t  By  aid  of  our 
theoretical  investigations  it  is  clear  that  the  average  size  of  a  family  (Mj),  as  deduced 
from  our  record  data  (M'l  or  M"i),  is  about  3-5  children,  if  the  marriage  lasts  till  the 
death  of  one  partner,  or  at  least  till  15  years  ;  it  is  about  3-9  to  4  children  if  the 
duration  of  the  marriage  is  at  least  15  years.  Reproductive  selection  would  increase 
this  average  by  about  -5  child  per  generation  were  its  influence  not  counteracted 

*  "Mathematical  Contributions  to  the  Theory  of  Heredity,  on  the  Law  of  Ancestral  Heredity," 
'  Roy.  Soc.  Proc.,'  vol.  62,  p.  397. 

t  In  the  paper  on  "  The  Law  of  Ancestral  Heredity  "  ('  Roy.  Soc.  Proc.,'  vol.  62,  p.  412)  it  is  stated 
that  fertility  is  probably  inherited,  but  the  amount  falls  below  that  which  would  be  indicated  by  the 
law  of  ancestral  heredity.  At  that  time  only  Case  (i.)  of  Section  (9)  and  Case  (i  )  of  Section  (10)  had 
been  worked  out  in  detail.  It  is  the  rise  of  correlation  with  more  stringent  limitation  of  opposing 
influences,  which  suggests  that  after  all  that  law  is  true  for  fertility  as  for  other  characters. 
VOL.  CXCTI. — A.  2  r 
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by  a  variety  of  other  factors  of  evolution.  These  factors  are  so  active  that  the 
influence  is  redaced  to  "12  of  a  child  per  generation  if  we  take  Case  (i.)  of  Section  (9), 
and,  we  have  little  doubt,  would  be  practically  insensible  did  we  take  all  marriages 
without  any  limitation  whatever.  Reproductive  selection  must,  therefore,  be  looked 
upon  as  always  tending  to  increase  the  fertility  of  a  race  ;  races  are  not  only  ever 
tending  to  increase,  but  tending  to  increase  the  rate  at  which  they  increase — a  feature 
not  recognised  by  Malthus,  but  which  strengthens  certain  of  his  arguments.  So 
soon,  therefore,  as  environment,  or  other  circumstance,  relieves  the  pressure  of 
opposing  factors,  a  race  will  not  only  increase  in  numbers,  but  also  in  fertility.  It  is 
this  inherited  character  of  fertility,  and  its  constant  tendency  to  change  unless  held 
in  check  by  natural  selection  or  other  factor  of  evolution,  which  seems  to  us  the 
source  of  the  immense  diversity  in  fertility  to  be  observed  not  only  in  different  species, 
but  in  local  races  of  the  same  species. 

III.  On  the  Inheritance  of  Fecundity  in  Thoroughbred  Racehorses.     By  Karl 
PEARSOisr,  F.KS.,  with  the  assistance  of  Leslie  Bramley-Moore.^ 

(13.)  The  data  provided  for  the  fertility  of  thoroughbred  racehorses  by  the  stitd- 
hooJcs,  are  of  a  kind  which  cannot  be  hoped  for  except  in  the  cases  of  pedigree  animals 
kept  for  breeding  purposes,  and  of  specially-arranged  experiments  on  insects,  &c. 
We  have  a  practically  complete  record  of  the  stud-life  of  every  brood-mare.  The  sire 
by  whom  she  has  been  covered  in  each  year  is  stated,  and  the  result,  barren,  dead 
foal,  living  filly  or  colt,  twins,  &c.,  can  be  ascertained.  It  is  also  possible  to  find  out 
whether  the  foal  dies  young,  say  as  a  yearling.  By  examining  the  whole  series  of 
stud-books  the  complete  pedigree  of  any  mare  or  sire  can  nearly  always  be  found, 
and  the  correlation  theoretically  worked  out  for  almost  any  degree  of  relationship. 

In  starting  an  investigation  of  this  kind  on  such  a  great  mass  of  raw  material, 
it  is  necessary  to  draw  up  certain  rules  for  the  extraction  and  arrangement  of  data. 
These  rules  must  be  prepared  without  any  definite  knowledge  of  the  character  of  the 
material  in  hulk,  for  this  can  only  be  found  after,  perhaps,  some  1000  cases  have 
been  extracted  and  worked  out.  Hence  the  rules  originally  adopted  are  often  not 
such  as  an  investigator  would  have  arranged  had  he  known  beforehand  the  general 
character  of  the  conclusions  he  would  reach.  But  the  statistician  cannot,  like  the 
experimental  physicist,  modify  without  immense  labour  his  methods  and  repeat  his 
experim^ent.  The  collection  of  his  data  has  frequently  been  far  too  laborious  a  task 
for  repetition.    His  raw  material  has  been  prepared  in  a  certain  manner;  he  may 

*  During  the  tliree  years  in  wliicli  this  investigation  has  been  in  progress,  a  considerable  number  of 
friends  have  given  me  substantial  aid  in  the  arithmetical  work,  or  in  the  preparation  of  the  6^000 
pedigree  cards  on  which  the  results  are  based.  Mr.  BRAMLEY-MooliE  has  latterly  been  my  chief  helper, 
but  I  am  also  much  indebted  to  Miss  Alice 'Lee  and  Mr.  G.  U.  Yule.  Miss  Margaret  Shaen  and  Miss 
LiNA  EcKKMSTEiN  have  also  contributed  to  the  labour  of  extracting  the  raw  data  from  the  stud-books. 
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sort  and  rearrange  his  data  cai'ds  in  a  variety  of  ways,  but  to  prepare  new  cards  on 
a  different  system  is  practically  beyond  his  powers. 

These  remarks  are  made  in  order  to  meet  criticism  of  the  method  in  wliich  my 
data  cards  were  prepared.  I  could  now  possibly  extract  more  convenient  data,  but 
that  is  only  because  of  the  knowledge  gained  in  the  process  of  examining  the 
fecundity  of  several  thousand  horses.  I  did  not  even  know,  ah  initio,  the  extent  of 
variability  in  equine  fertility;  I  did  not  even  know  the  immense  preponderance 
which  would  have  to  be  given  to  certain  sires,  at  any  rate  I  had  no  numerical 
estimate  of  it.  Nor  had  I  an}^  percentage  of  the  number  of  cases  in  which  a 
pedigree  might  end  abruptly  with  an  alternative  sire.* 

I  saw  at  once  that  the  apparent  fertility  of  racehorses  was  even  less  close  to  their 
potential  fertility  (which  I  presume  to  be  the  inherited  character)  than  in  the  case 
of  man.  Mares  go  at  different  ages  to  the  stud,  they  remain — for  reasons  not 
stated — uncovered  for  occasional  years,  or  periods  of  years  ;  they  return  to  the 
training  stable  for  a  time ;  they  are  sold  abroad  ;  they  are  converted  into  hunters, 
put  into  harness,  or,  as  is  occasionally  recorded,  sold  to  cab  proprietors.  This  by  no 
means  invariably  denotes  that  their  fertility  is  exhausted  ;  their  offspring  may  be 
bad  racers,  or  their  stock  unfashionable.  Very  frequently  also  we  find  the  mare  put 
to  a  cart-horse  stallion  for  a  year,  a  few  years,  or  for  the  remainder  of  her  career, 
and  then  no  record  at  all  is  given  of  the  result.  Thus  the  total  fertility  recorded 
can  have  but  small  correlation  with  the  potential  fertility,  and  I  was  compelled  to 
deal  with  fecundity.  The  insufficiency  of  the  apparent  fertilities,  as  recorded  in  my 
mare  index,  to  solve  the  problem,  may  be  illustrated  in  the  following  manner  : 
1100  cases  of  the  apparent  fertilities  of  mares  and  dams  having  had  at  least  four 
coverings  were  tabulated  (Table  IX.).  The  following  results  were  calculated  from 
this  table,  the  subscript  m  referring  to  mare  and  d  to  dam  : — 

M, 

The  probable  error  of  r„„;  =  '0202,  and  thus  we  might  argue  that  a  fertile  dam 
has,  on  the  average,  infertile  offspring.  But  an  examination  of  the  above  numbers 
shows  us  that  the  dams  are  more  variable  than  the  mares,t  and  yet  the  dams  have 
been  theoretically  subjected  to  the  greater  selection,  for  they  must  all  be  granddams, 
or  the  fertility  of  the  mares  could  not  have  been  recorded.  We  are  forced  to 
conclude  that  the  mares  have  been  in  some  manner  selected,  and  the  form  of  tlie 
selection  is  fairly  obvious  on  examining  the  table.    There  appears  a  great  defect  of 

*  Even  the  pedigree  of  sncli  a  famous  racehorse  as  Gladiateur  is  soon  checked  by  the  occuri-euce  of 
alternative  sires.    His  sire,  Monarque,  was  the  son  of  either  The  Baron,  or  Stiug,  or  the  Emperor, 
t  The  variability  of  mares,  as  a  whole,  not  separated  into  njai  cs  and  dams,  is  (see  Art.  lli)  o'2775. 

2  P  2 


=  7-6655, 
=  3-3652, 


M,,  =  6-1391, 
a-,,,  =  3-1617, 


=  —  -0868. 
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mares  in  the  third  quadrant,''  i.e.,  of  mares  and  dams  of  large  fertihty,  the  frequency- 
is  cut  off  abruptly  in  this  quadrant.  The  reason  for  this  is  fairly  clear.  We  have 
dealt  with  a  limited  number  of  years,  about  30,  of  horse-breeding  ;  hence,  when  the 
dam  has  a  long  record,  her  later  offspring  at  any  rate  cannot  possibly  have  a  long 
one  ;  when  she  has  a  short  one,  it  is  possible  for  them  to  have  a  long  one.  Accord- 
ingly, there  has  been  a  process  of  unconscious  selection,  which  has  led  to  a  negative 
correlation  of  the  apparent  fertilities. 

To  illustrate  the  point  further,  two  more  correlation  coefficients  were  obtained.  In 
Table  X.  are  given  the  apparent  fertilities  of  mares  and  their  dams  with  a  minimum 
of  eight  coverings.    We  find  : 

M,  =  8-619],         M,„=  7-6309, 
o-,i  =  3-1656,         cr,„  =  2-8149, 
r„„,  =  -  -0876. 

The  probable  error  is  again  about  '0202.  While  the  mares  now  form  a  group  with 
their  mean  fertility  almost  equal  to  that  of  the  dams  in  the  previous  result,  their 
variability  is  markedly  less.  Relatively  to  the  dams  its  reduction  is  even  greater. 
The  correlation  is  sensibly  the  same.  It  would  thus  seem  that  the  anomalous 
selection  of  mares  which  thus  reduces  their  variability  so  markedly  below  that  of 
the  dams  is  not  in  the  low  fertilities. 

I  now  removed  from  the  Table  IX.  all  parts  of  it  concerning  mares  with  a  fertility 
greater  than  8  ;  867  mares  and  dams  remained  with  a  minimum  limit  of  four 
coverings,  the  mares  not  having  a  greater  fertility  than  8  offspring.    I  found : 

M;,  =  7'76S6,         M,„  =  4-8558, 
a-a  =  3-3983,         a,,,  =  1-9887, 
r  =  —  -0190. 

The  probable  error  of  r  =  '0229.  Now  the  line  of  regression  for  dams  on  mares 
ouffht  to  be  the  same,  whether  we  obtain  it  from  this  result  or  from  the  first  results 
in  which  mares  with  more  than  8  offspring  are  included.  Yet,  in  this  case,  there  is 
no  sensible  correlation  at  all.  In  other  words,  if  we  exclude  the  data  for  large 
fertilities,  we  should  have  to  conclude  that  there  was  no  correlation  between  the 
apparent  fertilities  recorded  for  mares  and  their  dams.  We  are  thus  forced  to  conclude 
that  apparent  fertility  is  a  character  depending  on  tlie  manner  in  which  the  record 
is  formed,  and  must  be  useless  for  the  investigation  of  inheritance.  This  investi- 
gation strengthens  ray  d  prioy^i  reasons  for  selecting  fecundity,  not  apparent  fertility, 
as  the  character  to  be  investigated.  I  took  the  fecundity  of  a  brood-mare  to  be  the 
number  of  her  living  offspring  divided  by  the  potential  number  of  her  offspring 
under  the  given  circumstances.  Of  both  numerator  and  denominator  of  this  ratio 
I  must  say  a  few  words. 

*  Tlie  portion  of  the  table  cut  off  by  vertical  and  horizontal  lines  through  the  means  of  dams  and  mares, 
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In  considering  the  inheritance  of  fertility  I  had  two  different  problems  in  my 
mind  :  (i.)  Is  fertility  pure  and  simple  inherited  ?  i.e.,  Does  a  very  fertile  m.are  have 
offspring  more  fertile  than  the  average  1  And  (ii.)  What  effect  does  reproductive 
selection  actually  have  on  the  population  ?  i.e.,  To  what  extent  is  it  screened  by 
other  factors  of  evolution ;  does  the  very  fertile  mare  actually  have  more  offspring 
than  the  less  fertile  1  Is,  for  example,  her  stock  weedy  and  likely  to  die  early  ?  In 
the  case  of  mankind,  the  fertility  of  a  woman  is,  as  a  rule,  effectively  brought  to 
its  limit  with  the  end  of  her  marriage,  and  accordingly  I  started  with  completed 
marriages.  In  the  case  of  a  brood-mare  her  effective  fertility  depends  not  on  the 
offspring  she  has  but  on  the  number  of  these  which  survive  foaldom.  It  would  doubt- 
Jess  have  been  better  to  have  treated  these  two  problems  of  fertility  separately,  but 
being  fairly  confident  from  Proposition  I.,  p.  260,  that  fertility  must  be  inherited,  I  was 
more  interested  to  test  the  actual  effect  of  reproductive  selection.  Accordingly  I 
selected  as  the  numerator  of  ray  fecundity  ratio,  not  the  number  of  foals  born,  but 
those  who  survived  to  the  yearling  sales.  The  difference  is  not  very  great,  but  quite 
sensible.  For  example,  the  mean  fecundity  of  3909  brood-mares,  measured  in  my 
way,  =  '6343,  i.e.,  63  surviving  offspring  on  the  average  of  100  coverings. 

The  following  table  gives  the  result  of  reckoning  merely  barren  mares  and  those 
slipping  foals  or  giving  birth  to  dead  foals  in  a  twenty -year  period  : — 


Average  Fecundity  of  Brood-mares. 


Year. 

Average  fecundity. 

Year. 

Average  fecundity. 

1873 

•712 

18S3 

•693 

1874 

•703 

1884 

•678 

1875 

•707 

1885 

•702 

1876 

■697 

1886 

•700 

1877 

•692 

1887 

•682 

1878 

•680 

1888 

•695 

1879 

■683 

1889 

•685 

1880 

■066 

1890 

•686 

1881 

•680 

1891 

•679 

1882 

•667 

1892 

•675 

The  averages  of  five-year  periods  are  : 

•702,  -675,  -691,  -684, 

and  of  the  whole  period,  '68 8. 

There  does  not  appear  to  be  sufficient  evidence  for  any  secular  change  here,  and 
we  may  take  '688  to  represent  the  average  fecundity  of  the  brood-mare,  reckoning 
viable  offspring  to  the  number  of  coverings.  The  difference  of  "688  and  "634  gives 
a  death-rate  of  5 '4  foals  in  68 '8,  or  a  death-rate  of  7*85  per  cent,  of  foals  before 
maturity.  If  a  considerable  part  of  this  death-rate  be  differential,  we  have  room 
for  natural  selection  influencing  the  drift  of  reproductive  selection.    The  standard 
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deviation  in  the  fecundity  is,  however,  about  '191,  or  about  19  foals  in  the  G3,  or 
about  30  per  cent. — a,  very  ^reat  variation,  so  that  if  fecundity  be  inherited,  a 
differential  death-rate  of  the  inuiiature  will  hardly  suflice  to  check  it. 

So  much  then  of  the  numerator  of  my  ratio.  I  have  spoken  immediately  above  of 
the  denominator  as  if  it  were  the  number  of  times  the  mare  had  been  covered. 
It  is  generally  this,  but  in  the  relatively  few  cases  where  the  mare  has  given  birth 
to  twins,  I  have  counted  that  covering  twice.  Had  this  not  been  done  the  fecundity 
might  have  been  greater  than  unity,  for  example  even  in  some  exceptional  cases 
have  I'isen  to  two.  On  the  other  hand,  a  loss  of  twins  would  have  been  marked  by 
no  greater  change  in  fecundity  than  a  loss  of  one  foal,  or  the  survival  of  one  twin 
would  not  have  been  different  in  its  effect  on  fecundity  to  the  birth  of  a  foal.  In 
order,  therefore,  to  avoid  these  difhculties — especially  that  of  isolated  individuals 
lying  far  beyond  the  fecundity  range  of  0  to  1 — when  twins  were  born  the  poten- 
tiality of  the  covering  was  reckoned  in  the  denominator  as  two.  The  relative 
infrequency  of  twins  causes,  however,  this  modification  of  the  denominator  to  have 
small  influence  on  the  result. 

My  next  step  was  to  form  some  estimate  of  the  extent  to  which  fecundity  thus 
measured  was  the  same  for  different  periods  in  a  mare's  breeding  career.  I  expected 
fecundity  to  diminish  with  age  as  in  the  case  of  mankind,  but  taking  out  a  fairly 
large  test  number  of  mares,  I  found  that  their  fecundity  for  the  periods  covered  by 
two  successive  stud-books  was  in  the  majority  of  cases  closely  the  same.  With  larger 
experience  I  should  now  lay  more  weight  on  the  decrease  of  fecundity  with  age  ;  and 
I  also  think  fecundity  is  smaller  when  the  mare  first  goes  to  the  stud.  But  even 
thus  much  of  the  reduced  fecuiidity  of  old  mares  seems  to  arise  from  breeders  sending 
famous  mares  to  the  sire  long  after  their  breeding  days  are  passed.  I  have  several 
records  of  old  mares  being  covered  seven  or  eight  times  without  offspring.  This 
custom  of  breeders  was  much  more  rife  in  the  early  days  of  breeding  than  it  appears 
to  be  now,  when  some  breeders  discard  or  sell  a  fairly  old  mare,  even  if  she  is  barren 
two  or  three  successive  years.  Clearly  the  custom  gives  the  mare  a  fictitious 
fecundity,  far  below  her  real  value,  and  probably  accounts  for  granddams  having  a 
somewhat  less  fecundity  than  their  granddaughters. 

The  next  problem  to  be  answered  was  the  effect  the  method  of  forming  my 
fecundity  ratios  might  have  on  the  relative  numbers  which  would  be  found  in 
different  element-groups.  For  example,  supposing  the  element  of  fecundity  to  be 
1/10,  or  the  element-groups  0-1/20,  f/20-8/20,  3/20-5/20, .  .  .  17/20-19/20,  19/20-1, 
would  the  fact  that  the  fecundity  ratio  is  a  ratio  of  ivhole  numbers  cause,  d  priori,  a 
greater  probability  of  frequency  in  one  of  these  element-groups  than  another  1 

To  begin  with,  all  estimation  of  fecundity  based  on  less  than  four  coverings  was 
discarded.  Three  coverings  give  too  rough  an  appreciation  of  a  mare's  fecundity,  it 
can  only  fall  into  one  of  the  values  0,  1/3,  2/3,  and  1.  The  question  then  arises,  if 
all  the  fecundities  : 


MATHEMATICAL   CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION".  295 


0/4,  1/4,  2/4,  3/4,  4/4, 
0/5,  1/5,  2/5,  3/5,  4/5,  5/5, 
0/6,  1/6,  2/6,  3/6,  4/6,  5/6,  6/6, 


0/26,  1/26,  2/26,  ....  25/26,  26/26, 

were  equally  likely,  how  would  the  frequency  depend  on  the  grouping 

Taking  26  coverings  as  the  probable  maximum — it  actually  occurs— we  have  for 
the  total  number  of  fecundities  given  above  :  54-6  +  7-|-...-{-  27  =  368  separate 
fecundities.    Let  us  see  how  they  would  be  divided  in  one  or  two  cases. 

Case  (i.)  Let  the  elements  be  based  on  1/8,  or  be  0-1/16,  1/16-3/16,  3/16-5/16, 
5/16-7/16,  7/16-9/16,  9/16-11/16,  11/16-13/16,  13/16-15/16,  15/16-1. 

The  half-groups  at  the  ends  are  taken  so  that  zero  and  perfect  fecundity  should 
really  be  plotted  at  the  middle  of  a  1/8  element.  We  find,  adding  up  the  numbers 
of  the  above  fecundities  which  fall  into  the  nine  groups,  the  following  frequencies  : — 

33-5,    42,    43-5,    44,    42,    44,    43-5,    42,  33-5. 

There  is  thus  a  somewhat  deficient  frequency  in  the  terminal  groups,  and  this 
would  probably  to  some  extent  bias  tlie  distribution. 
Case  (ii.)  Let  the  elements  be  based  on  1/15,  or  be 

0-1/30,  1/30-3/30,  3/30-5/30,  .  .  .  25/30-27/30,  27/30-29/30,  29/30-1. 

We  liave  the  following  distribution  : 

23,  22,  23,  23-5,  22-5,  23-5,  23'5,  22,  22,  23-5,  23-5,  22-5,  23-5,  23,  22,  23. 

The  bias  here  is  only  slight  and  the  distribution  is  on  the  whole  very  satisfactory. 
Case  (iii.)  Let  the  elements  be  based  on  1/20,  or  be 

0-1/40,  1/40-3/40,  3/40-5/40,  .  .  .  35/40-37/40,  37/40-39/40,  39/40-1. 

We  find  for  the  groups  : 

23,  13,  17-5,  17-5,  17,  18,  17,  18,  17-5,  17,  18,  17,  17*5,  18,  17,  18,  17,  17-5,  17-5,  13,  23. 

Here  the  terminal  groups  have  too  great  a  frequency,  and  the  adjacent  groups  too 
little.  It  is  clear  that  the  division  into  1/15  elements  is  better  than  those  of  1/8  of 
1/20,  so  far  as  these  results  go.  But  unfortunately  the  different  coverings  do  not 
occur  in  anything  like  the  same  proportions.  Their  exact  frequencies  could  only  be 
found  d  posteriori,  and  I  was  desirous  of  having  some  idea  of  grouping  before  start- 

*  Such  problems  are  really  not;  infrequent  in  statistical  investigation.'!,  ard  seem  to  be  of  some 
interest  for  the  theory  of  fractional  numbers.  Mr.  FiLON  worked  out  for  me  the  details  of  the  cases 
given  below. 
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ing  the  labour  of  extraction  I  therefore  weighted  the  different  coverings  on  the 
basis  of  a  small  preliminary  investigation  as  follows  : 

Case  (iv.)  Number  of  coverings,    4  to    5  inclusive,  loaded  with  2. 

6  to    9  ,,  „  3. 

10  to  15  „  „  4. 

16  to  18  „  „  2. 

19  to  26  „  „  1. 

The  resulting  system  of  frequencies  was  : 

54,  42-5,  45-5,  47,  45-5,  46-5,  46,  45,  45,  46,  46-5,  45-5,  47,  45*5,  42  5,  54. 

This  system  is  not  so  uniform  as  in  Case  (ii.),  T  had  hoped  that  the  744  frequencies 
would  have  been  fairly  closely  the  double  system  of  Case  (ii.).  The  main  irregularity 
occurs  at  the  terminal  groups,  or  those  having  fecundities  nearly  zero  and  nearly 
perfect.  These  I  considered  would  be  relatively  infrequent,  when  we  started  with  as 
many  as  four  coverings,  and  had  an  average  failure  of  about  37  in  100.  The  sequel 
showed  that  the  assumption  was  legitimate,  so  far  as  regards  zero  fecundity,  but  that 
perfect  fecundity  was  sufficiently  frequent  to  cause  a  hump  in  the  frequency  curve 
for  fecundity,  corresponding  to  the  group-element  29/30  to  1.  The  frequency  of 
this  group  is  greater  than  that  of  the  group  27/30  to  29/30,  when  we  start  from  at 
least  four  coverings.  This  hump  entirely  disappears,  however,  if  we  start  with  at 
least  eight  coverings.  Thus  I  take  the  hump  to  be  purely  '"' spurious,"  f.^'.,  a  result  of 
the  arithmetical  processes  employed,  and  not  an  organic  character  in  fecundity.  It 
depends  upon  our  definition  of  fecundity,,  which  is  not  a  truly  continuous  quantity. 

As  the  theory  of  correlation  applied  is  not  in  any  way  dependent  on  the  form  of 
the  correlation  surface,  beyond  the  assumption  of  nearly  linear  regression,  tlie  hump 
cannot,  I  think,  sensibly  affect  our  conclusions.  Had  I  known,  however,  a  ])riori, 
what  the  frequency  of  different  coverings  and  the  nature  of  the  fecundity  frequency 
curve  would  be,  1  should  have  attempted  to  choose  such  a  group-element,  that,  with 
proper  weighting  of  the  coverings,  there  would  have  been  no  arithmetical  bias  to 
the  terminal  groups.  As  it  was,  it  seemed  to  me  that  the  group-element  of  1/15 
gave  fairly  little  arithmetical  bias — at  any  rate  where  the  bulk  of  the  frequency  would 
occur — and  it  was  accordingly  adopted  as  a  basis  for  classifying  fecundities. 

The  dfficulty  illustrates  tlie  point  I  have  referred  to,  namely,  that  in  statistical 
investigations  the  best  classification  can  only  be  found  d  posteriori,  but  the  classifi- 
cation adopted  has  usually  to  be  selected  d  priori. 

The  1/15  element  being  selected,  the  letters  a,  h,  c,  d,  e,  f,  g,  k,  i,j,  k,  I,  m,  n,  j^,  q 
were  given  to  the  16  gi-oups  of  fecundities  from  0  to  1,  as  cited  under  Case  (ii.).* 

*  A  table  was  formed  of  the  368  actually-occurring  fecundities,  from  which  it  was  possible  to  at  once 
read  off  the  group  (or  it  might  be  two  groups,  e.g.,  "5  falls  half  into  7t  and  half  into  ?')  into  which  they 
each  fell. 
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Thus  the  fecundity  of  a  mare  was  described  by  one  of  these  16  letters.  Here  the 
centre  of  the  j  group,  for  example,  is  '6,  and  it  covers  all  fecundities  from  '56  to  -63. 
Thus  midway  between  j  and  h  we  are  at  about  the  mean  fecundity. 

The  more  recent  Stud-Books,  vols.  12  to  17,  were  taken  as  containing  more 
complete  details  and,  what  is  more  important,  less  in-and-in  breeding,  although  as  we 
shall  see,  this  is  still  an  important  factor.  These  volumes  cover  30  and  more  years 
of  English*  stud  life.  From  these  30  years'  records  upwards  of  5000  mares,  who  had 
been  covered  upwards  of  four  times,  had  their  fecundity  ascertained.  The  process 
was  a  very  laborious  one,  as  each  mare  had  generally  to  be  sought  for  in  several 
volumes,  and  the  records  in  each  volume  are  not  continuous,  but  overlap  by  quite 
arbitrary  numbers  of  years.  Further,  great  care  had  to  be  taken  to  identify  each 
mare  properly,  as  the  same  name  is  very  frequently  repeated,  and  the  like  difficulty 
occurs,  though  to  a  lesser  extent,  in  the  case  of  sires.  A  card  was  then  written, 
giving  the  name  of  the  mare  and  those  of  her  sire,  her  dam,  and  her  dam's  sire. 
Upon  this  card  the  letter  indicating  her  fecundity  was  placed.  A  card  alphabet  of 
mares  was  thus  formed,  consisting,  in  the  first  place,  of  about  3000  entries.  This 
alphabet  was  again  gone  through  and  the  fecundity  of  the  dams  of  the  mares  inserted 
on  the  cards  till  there  were  about  2500  cases  know^n  of  mare  and  dam.  The  dams 
were  partly  found  from  the  existing  series,  but  it  was  also  largely  necessary  to  work 
out  fresh  cases.  Lastly,  the  cards  were  gone  through  and  the  fecundity  of  the  grand- 
dams  entered  in  upwards  of  1000  cases.    This  forms  the  first  series  of  cards. 

In  the  next  place  a  card  index  was  formed  of  all  the  sires  serving  during  these 
30  years.  This  contained  upwards  of  1000  cards.  On  these  cards  the  sire's  sire  was 
entered,  and  the  fecundity  of  all  the  mares  contained  in  the  first  or  mare  alphabet 
was  now  taken  off  and  placed  on  the  card  of  the  mare's  sire.  Thus  the  card  of  each 
sire  had  the  letters  a,  h,  c,  d,  e,  &c.,  upon  it,  and  a  frequency  distribution  was  formed 
on  the  card  of  each  sire  for  the  fecundity  of  his  daughters. 

The  same  thing  was  done  for  the  sires'  sires  ;  only  here  recourse  had  again  to  be 
had  to  the  stud-books  to  obtain  the  fecundity  of  the  daughters  of  the  more  ancient 
sires.  Finally,  a  sire-alphabet  was  obtained  which  gave  the  average  fertility  of  the 
daughters  of  a  sire  and  of  the  daughters  of  his  sire,  or  his  half-sisters.  On  these 
cards  was  also  placed  the  number  of  mares  upon  which  each  average  was  based. 

These  two  card-alphabets,  the  mare  and  sire  alphabets,  form  the  "  dressed " 
material  upon  which  all  the  subsequent  calculations  were  based. 

(14.)  At  this  point  it  seems  desirable  to  insist  somewhat  on  the  many  causes  which 
tend  to  make  the  fecundity  of  mares,  as  thus  determined,  to  a  considerable  extent 
fictitious.  Many  of  these  were  only  apparent  to  me  as  I  became  more  and  more 
familiar  wdth  the  material. 

*  Irisli  mares  were  excluded  except  where,  for  pedigree  purposes,  it  was  necessary  to  deal  with  them. 
Many  Irish  mares  were  further  included  when  it  came  to  the  valuation  of  the  fertility  of  mares  due  to 
a  given  sire. 

VOL.  CXCII. — A.  'I  q 
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(a.)  Mares  appear  to  be  less  fecund  at  the  beginning  and  end  of  their  breeding 
career.  Hence,  when  the  fecundity  is  based  on  a  part  only  of  their  career,  as  it  often 
must  be,  we  do  not  really  get  a  fair  appreciation. 

(h.)  A  more  fertile  mare  is  likely  to  have  more  daughters  go  to  the  stud  than  a  less 
fertile  one,  and  hence  we  get  a  better  appreciation  of  the  fertility  of  the  offspring  of 
the  former  than  of  the  latter. 

(c.)  Fashion  among  breeders  interferes  largely  with  the  exhibition  of  the  natural 
fecundity  of  a  mare.  She  may  be  a  famous  mare  and  is  sent  to  a  famous  sire,  even 
though  produce  is  not  so  likely  as  if  she  were  put  to  a  sire  of  a  different  class.  This 
appears  to  be  practically  recognised  when  apparently  barren  mares  are  sent  in  one 
season  to  two,  or  even  three  sires,  or  again  to  half-bred  horses  or  cart-horses. 

(d.)  Brood-mares  which  have  produced  performers  are  kept  much  longer  at  the 
stud,  and  we  have  the  fecundity  lowered  by  coverings  after  the  mare  is  sensibly 
sterile.    Less  important  mares  are  removed  sooner  from  the  stud. 

(e.)  Good  racing  mares  are  often  put  late  to  the  stud. 

{/.)  In  a  certain  number  of  cases  we  are  simply  told  that  the  mare  had  no  produce 
for  a  period  of  years,  but  whether  she  was  covered  or  not  is  unrecorded. 

(g.)  Second-rate  mares,  or  mares  thought  to  be  near  the  end  of  their  fecundity,  are 
often  sold  abroad.  In  the  latter  case  the  fecundity  is  fictitiously  increased;  in  the 
former  we  have  only  a  short  period  to  base  it  on. 

(A.)  There  is  no  record  kept  of  the  half-bred  foals,  which  for  our  purpose  are  as 
important  as  the  thoroughbred  foals.  "Put  to  a  hunter"  is  a  not  uncommon  record, 
with  no  statement  of  the  result. 

(i.)  Comparatively  infertile  mares,  unless  of  very  valuable  stock  or  famous  as  racers, 
are  not  kept  long  enough  at  the  stud  to  get  a  reliable  measure  of  their  fecundity. 

(/.)  The  smaller  breeders  will  often  put  mares  to  inferior  sires,  already  nearly 
worn  out,  either  because  they  own  them,  or  because  their  fee  is  low  ;  and  thus  again 
a  full  chance  is  not  given  to  the  fecundity  of  the  mare  to  exhibit  itself. 

[k.)  We  have  excluded  in  our  determination  of  the  fecundity  foals  dying  young. 
This  is  often  due  to  the  fault  of  the  mare,  but  is  often  again  due  to  the  environment. 

(/,)  Lastly,  thoroughbred  mares  are  highly  artificial  creatures,  and  many  must 
suffer  from  their  environment,'"  either  in  the  matter  of  barrenness  or  slipping  foal,  in 
a  manner  from  which  the  wild  horse  or  a  more  robust  domesticated  animal  would  be 
entirely  free. 

These  considerations  may  suffice  to  show  that  our  values  of  the  fecundity  will  only 
roughly  represent  what  may  be  termed  the  natural  fecundity,  and  we  ought  not  for 

*  I  am  told  that  there  are  like  difficulties  with  cows.  Cow.s  are  very  liable  to  slip  their  calves,  and 
one  cow  doing  so,  several  others  in  the  herd  will  or  may  follow  her  example.  There  is  a  strong  folk- 
belief  in  Wiltshire — I  give  it  merely  as  evidence  of  what  a  slight  change  in  the  environment  is  supposed 
to  achieve — that  the  habitual  presence  of  a  donkey  with  the  herd  in  some  way  soothes  the  cows,  and 
renders  them  less  ready  to  slip  their  calves. 
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a  moment  to  expect  inheritance  in  the  full  intensity  of  the  Galtonian  law  to  be 
exhibited  by  such  material. 

(15.)  But  there  is  another  point  of  very  considerable  importance  for  the  weakening 
of  correlation,  namely  the  effect  of  in-and-in  breeding.  To  get  correlation  we  must 
have  a  diversity  of  parents  producing  a  diversity  of  oflPspring,  but  when  the  parents 
become  more  and  more  identical,  we  get  larger  and  larger  arrays  between  which  and 
the  parents  the  correlation  is  weakened.  For  example  :  suppose  the  correlation  found 
between  all  parents  and  offspring  in  the  general  population,  and  now  select  only  all 
the  brothers  in  a  large  array  and  find  the  correlation  between  them  and  their 
offspring,  we  shall  find  that  the  correlation  is  lower  than  in  the  previous  case.""'  It 
would  be  impossible  to  apply  theory  to  the  present  case,  however,  because  we  can 
only  roughly  appreciate  the  extent  of  such  in-and-in  breeding.  That  it  is  great  the 
following  statistics  will  show. 

Of  the  more  than  1000  sires  in  my  sire  alphabet,  only  760  were  sires  of  mares 
which  had  been  covered  at  least  four  times.  These  760  sires  had  upwards  of 
5000  offspring,  of  whom  I  had  the  fecundity  recorded,  but  when  mares  with 
alternative  sires  were  excluded,  there  remained  only  4677  available  mares.t  These 
mares  were  distributed  as  follows  : — 


Daughtei'S  .... 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

280 

113 

78 

43 

29 

22 

20 

21 

22 

14 

10 

10 

Danghters  .... 

13 

14 

15 

16 

17 

18 

19 

20 

Above  20 

11 

11 

8 

6 

2 

6 

4 

4 

46 

Here  the  second  line  gives  the  number  of  sires  having  the  number  of  daughters 
in  the  first  line  in  the  4677  cases,  which  I  take  to  be  a  fair  sample. 

Thus  over  a  third  of  the  sires  had  only  one  mare.  Two-thirds  of  the  sires  had 
together  only  one-fifth  of  the  mares.  Seventy-six  of  the  sires  were  fathers  of 
about  half  the  mares,  and  46  sires  alone  produced  1801  mares,  almost  as  many  as 
642  sires  did.  We  are  here  dealing  with  the  fairly  long  period  of  30  years,  but 
even  making  due  allowance  for  young  stallions  commencing  and  old  stallions  con- 
cluding their  stud  career,  it  will  be  manifest  that  our  sample  shows  that  the  great 
bulk  of  mares  for  the  period  in  question  were  the  offspring  of  comparatively  few  sires. 

But  let  us  look  at  the  problem  from  the  standpoint  of  the  sires.    My  760  sires 

*  The  theory  of  such  cases  is  fully  developed  in  a  memoir  on  the  influence  uf  selection  on  correlation 
not  yet  publiislied. 

t  Some  other  cases  were  also  excluded  for  diverse  reasons. 

2  Q  2 
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were  all  fathered  among  themselves  except  in  49  cases.  In  other  words,  they  were 
the  sons  or  grandsons  of  only  49  sires.  Of  these  49  sires,  there  were  12  whose 
pedigree  I  could  not  trace,*  but  they  were  very  probably  sons  of  sires  already  on  my 
list  or  among-  the  remaining  37.  In  the  majority  of  oases  they  appeared  only  as 
the  sire  of  one  stallion.  The  remaining  37,  whose  pedigree  I  could  trace,  were 
descended  at  once  or  in  very  few  generations  from  9  sires.t  Thus  both  from  the 
standpoint  of  the  mares  and  of  the  sires  we  are  dealing  with  a  closely  in-bred 
stock,  and  this  is  one  and  probably  a  very  important  factor  in  the  weakening  of  the 
fecundity  correlation. 

Having  regard  to  these  difficulties,  if  we  can  succeed  in  showing  that  fecundity  in 
thoroughbred  racehorses  is  inherited,  we  can  be  fairly  confident  that  we  have 
only  reached  a  lower  limit  of  the  correlation  coefficient. 

(16.)  On  the  Inheritance  of  Fecundity  in  the  Female  Line. 

(i.)  A  preliminary  investigation  must  here  be  made,  in  order  to  determine  the  p  of 
the  formulae  given  in  Proposition  III.  (p.  269)  we  want  the  correlation  of  fecundity 
with  fertility.  If  ^  be  the  fecundity,  /  the  apparent  fertility,  and  c  the  number  of 
coverings,  twins  counting  as  a  double  covering,  we  have  : 

^  =  /A-,  ■ 

whence  if  we  determine  the  correlation  between  ^  and  f,  numerous  constants  will 
folloAv.  Table  XI.  gives  the  correlation  between  fertility  and  fecundity  for  1000 
brood-mares.    We  found  : 

=  -6375,  M^-  =  6-515, 

cr^  =  -1810,  o-j  =  3-2775, 
p  =  r^j  =  -5152, 

=  lOOo-^/M^  =  28-39,  Vj-  =  100cr,/M,-=  50'31, 

where  and  Vj-  are  the  "  coefficients  of  variation."!  Here  by  Yule's  Theorem  § 
r^fO-Ja-j-  is  the  slope  of  the  line  which  most  closely  fits  the  curve  of  regression  for 
fecundity  on  fertility.  If  we  supposed  this  curve  to  be  straight,  then  the  line  must 
coincide  with  it.  Now  since  fecundity  vanishes  with  fertility,  the  curve  passes 
through  the  origin,  and  hence,  if  the  regression  be  linear,  the  line  must  also  pass 
through  the  origin.  In  this  case,  as  is  shown  on  p.  270,  r^=  vjv  =  -5644.  The 
difl^erence  between  '5644  and  '5152  may  be  taken,  as  it  is  several  times  the  probable 
error,  to  indicate  that  the  regression  curve  between  fecundity  and  fertility  is  only 
approximately  linear. 

The  variations  in  both  fertility  and  fecundity  are  here  large.     Accordingly  we 

*  Stockmar,  Sovereign,  Andover,  Phaeton,  Prince  Caradoc,  Robert  Houdin,  Pylades,  King  of  Kent, 
Garry  Owen,  Calaban,  Homily  and  Taurus. 

t  Tramp,  Sir  Peter  Teazle,  Catton,  Buzzard,  Orville,  Diomed,  Sorcerer,  Dr.  Syntax,  Marske. 

+  '  Phil.  Trans.,'  A,  vol.  187,  p.  276. 

§  '  Roy.  Soc.  Proc,,'  vol.  60,  p.  477. 
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must  use  the  formula  (i.)  for  the  mean  vahie  of  an  index  given  in  my  memoir  on 
spurious  correlation,"^'  We  shall  then  obtain  an  approximate  value  to  the  mean 
nvnnber  of  coverings  of  each  mare.  Formulse  (iii.)  of  the  same  paper  will  then  give 
the  standard  deviation  for  the  number  of  coverings.    In  our  present  notation  : 

and  therefore  : 


We  find  : 

M,/M^  =  10-2196, 

and : 

M,  =  10-2196  X  1-007  =  10-2911, 
=  4-4455, 
V,  =  43-20. 

To  the  same  degree  of  approximation  we  can  further  ascertain  the  correlations 
between  the  number  of  coverings  and  the  apparent  fertility  and  fecundit}^,  i.e.,  r^j, 
and  A  short  investigation  similar  to  those  in  the  memoir  on  spurious  correlation 
just  cited  shows  us  that : 

^V/  =  (^/  - 

These  lead  to  the  numerical  results  : 

r,,  =  -8259,  r,^  =  -  '0572. 

The  conclusions  to  be  drawn  from  these  results  are  all  of  some  interest.  In  the  first 
place  we  may  ask  :  How  does  agree  with  its  value  found  from  other  and  more 
complete  series?  For  4677  mares — my  complete  series  without  mares  with  alternative 
sires — the  average  fecundity  was  '6373.  A  better  agreement  could  not  have  been 
hoped  for.  In  a  group  of  1509  mares  dealt  with  for  variation  only  and  entered  as 
"  daughters  "  on  the  cards — so  that  they  had  not  been  selected  by  the  fact  that  their 
daughters  must  have  recorded  offspring,  as  is  the  case  with  "  dam  "  entries — I  found 
the  following  results  : — 


Variation  in  Fecundity  of  1509  Brood-mares  (Four  Coverings). 


Fecundity. 

a. 

h. 

c. 

cl 

e. 

/• 

9- 

h. 

t. 

j- 

k. 

I. 

m. 

11. 

P- 

1- 

Frequency 

9 

3 

11 

26 

46 

43-5 

85 

1225 

154-5 

23:2-5 

194 

223 

146 

100 

23 

90 

Total  1509. 

M^  =  -6345. 

=  -1965. 

*  '  Roy.  Soc.  Proc.,'  vol.  60,  p.  492. 
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Now  this  is  precisely  what  we  might  expect ;  the  mares  belonged  to  a  class,  of 
which  we  are  not  certain  whether  their  daughters  have  or  have  not  recorded  fecundity. 
The  mean  fecundity  is  therefore  decreased  and  the  variability  increased.  Add  to  this 
group  2400  mares,  all  of  which  had  had  their  daughters'  fertility  recorded,  and  we 
find  for  3909  mares,  =  -6345  and  =  '1910,  i.e.,  the  mean  fecundity  ascends 
and  the  variability  falls.  Illustration  of  this  law  will  be  found  in  the  following  two 
groups  : — 


•6337 

•1888 

1200  dams  

■6525 

•1643 

Thus  we  send  up  the  mean  fertility  and  lower  the  variability  by  separating  into  two 
groups  the  pedigree  of  one  which  has  a  longer  record.  This  is  precisely  in  accordance 
with  the  theory  already  developed.  Our  mean  fecundity  and  variability  for  brood- 
mares may  be  considered  as  constant  characters,  and  variations  in  their  values  beyond 
their  probable  errors  due  to  conscious  or  unconscious  selection  in  the  record  itself, 
or  in  our  extracting  from  it. 

The  reader  will  notice  at  once,  if  he  turns  to  the  diagram  of  the  above  frequency, 
(i.)  that  there  is  a  small  hump  at  (a)  of  no  practical  importance,  and  a  larger  one 
at  {(j),  perfect  fertility  being  fairly  frequent  with  only  four  coverings,  and  there  being 
from  the  arithmetical  processes  involved  a  bias  towards  (q)  as  compared  with  (p). 
(ii.)  The  distribution  of  frequency,  although  somewhat  ragged,  is  quite  clearly  not 
normal,  but  of  the  character  which  in  other  papers  I  have  called  skeiv.  Were  there 
any  occasion,  it  would  be  easy  to  fit  it  with  one  of  my  skew  curves.  To  mark 
how  (i.)  will  disappear  and  (ii.)  become  still  more  apparent,  I  have  placed  on  the 
diagram  the  frequency  distribution  for  2000  mares  reduced  to  the  same  scale. 

260,  1  1  1  — — ,  1  1—  ,  . — ■  1  ,  ,  1  ■  1  1  , 
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Variation  in  Fecundity  of  2000  Brood-mares  (Eight  Coverings). 


Fecundity 

a. 

b. 

c. 

d. 

e. 

/■ 

'J- 

h. 

i. 

j- 

h. 

I. 

VI. 

w. 

V- 

9- 

Frequency 

0 

2 

7-5 

11-5 

21-5 

55 

104-5 

182 

271-5 

815 

337 

293-5 

204 

127 

49 

19 

Total,  2000. 

M^  = 

=  -6330.. 

•1568. 

Thus,  making  the  minimum  number  of  coverings  8  instead  of  4,  has  removed 
the  terminal  humps,  zero  fecundity  is  now  unknown,  and  perfect  fecundity  very  rare. 
We  have  reached  a  smooth  skew  frequency  distribution  ;  we  see  fecundity  as  a  con- 
tinuous character  obeying  the  usual  laws  of  variation.*  The  mean  fecundity  in  the 
two  cases  is  sensibly  the  same,  '633,  but  owing  to  the  fact  that  we  have  made  a 
selection  of  a  limited  group  in  the  second  case,  the  variability  is  considerably 
decreased. 

The  average  apparent  fertility  of  brood-mares,  6 "5 15,  must  not  be  confused  with 
their  average  real  fertility,  for,  as  we  have  seen,  we  have  in  many  cases  not  a  com- 
plete record  of  their  stud-life,  or  such  a  full  record  has  not  been  used  {e.g.,  in  case  of 
mares  still  at  the  stud,  but  having  been  already  covered  four  or  more  times).  Its  50 
per  cent,  variation  shows  that  an  apparent  fertility  of  9  to  12  is  not  infrequent. 
The  average  number  of  coverings  being  10  and  more,  it  will  be  seen  that  the  records 
of  between  50,000  and  60,000  coverings  have  been  dealt  with  to  form  our  mare  and 
sire  alphabets.  The  large  variability  in  the  number  of  coverings  shows  that  15 
to  20  coverings  will  not  be  infrequent,  and  cases  of  26  actually  occurred.  Lastly, 
we  have  the  correlation  between  fertility  and  the  number  of  coverings,  high  as 
might  be  supposed,  for  a  high  apparent  fertility  could  only  be  exhibited  by  many 
coverings.  Although  a  low  apparent  fertility  might  correspond  to  any  number  of 
coverings,  still,  in  practice  a  sterile  mare  will  not  be  sent  indefinitely  to  the  sire.  The 
correlation  between  the  number  of  coverings  and  the  fecundity  is  small  and  negative 
(—  '0572).  This  follows  from  the  principle  that,  fertility  being  the  same,  a  high 
number  of  coverings  reduces  the  fecundity,  and  this  factor  is  more  potent  than  the 
high  correlation  of  fertility  and  the  number  of  coverings. 

(ii.)  Table  XII.  exhibits  the  correlation  of  1200  mares  and  their  dams  with  regard 
to  fecundity.  Here  the  more  fertile  dams  are  weighted  with  their  fertility,  and  at 
least  four  coverings  were  required  of  each  mare.  If  the  subscript  w  refers  to  mare, 
and  d  to  dam,  we  find  : 

*  The  actual  equation  to  tlie  curve  referred  to  the  mode  '6531  as  oi\igin,  the  axis  of  x  being'  positI\'c 
towai'ds  perfect  fecundity,  and  the  unit  of  x  being  1/15  is : 

y  =  .342-187  (1  +  ,r/47-1358)8''«i  (1  -  .r/12-1106)2i-229l. 
The  fit        be  found  to  be  very  satisfactory, 
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M„.  =  '6337,  Mrf  =  -6525, 

o-„='1888,  a-,  =  '1643, 

r„,=  -0831. 

The  coefficient  of  regression  =  '0945. 

The  probable  error  of  the  correlation  is  '0193  and  of  the  regression"'*  *0195.  Thus 
these  quantities  are  four  to  five  times  their  probable  errors,  and  we  conclude  that 
fecundity  is  certainly  inherited. 

The  intensity  is  far  below  that  suggested  by  the  law  of  ancestral  heredity,  but  it 
nevertheless  exists.  Its  lowness  is  probably  due  to  the  fictitious  character  of  the 
fecundity  owing  to  the  causes  indicated  on  pp,  298-9.  An  attempt  must  now  be  made 
to  eliminate  some  of  the  factors  disguising  the  fecundity,  but  to  do  so  is  by  no  means 
so  easy  as  in  the  case  of  fertility  in  man. 

(iii.)  My  first  idea  was  that  by  taking  a  higher  limit  to  the  number  of  coverings  a 
closer  approach  might  be  obtained  to  the  true,  i.e.,  the  inherited  fecundity. 
Accordingly  Table  XIII.  was  formed  for  the  correlation  of  1000  mares  and  their 
dams,  when  the  minimum  number  of  coverings  was  eight.  But  I  did  not  recognise 
that  this  would  give  far  greater  weight  in  the  Table  to  the  older  mares,  and  that 
accordingly  causes  (d)  and  (i)  of  p.  298  would  now  play  a  much  larger  part  in  disguising 
the  true  fecundity  than  before.  There  appears  to  be  no  limit  to  the  number  of 
times  a  famous  old  mare  may  go  to  the  stallion  when  there  is  very  small  liope  of  any 
offspring. 

Table  XIII.  gives  us  the  following  results  : 

•6360, 
•1500, 

The  coefficient  of  regression  =  '0708. 

The  probable  error  of  the  correlation  is  ^0212,  and  of  the  regression  "0213,  both 
less  than  a  third  of  the  observed  values.  We  should  agfain  conclude  from  this  result 
that  fecundity  is  inherited,  although  it  offers  less  strong  evidence  than  the  previous 
case.  The  influence  of  selectiont  is  at  once  apparent  in  the  great  reduction  of  the 
variabilities.  The  fact  that  we  are  throwing  the  determination  of  fecundity  more 
on  to  the  old  age  period  of  life  appears  from  the  reduced  mean  fecundities.  I 
attribute  the  reduction  in  the  fecundity-correlation  to  this  source,  i.e.,  the  very  diverse 
treatment  which  old  mares  receive  at  the  hands  of  different  breeders. 

(iv.)  I  made  another  attempt  to  remove  screening  causes  by  taking  1200  more 

*  Pearson  and  Pilon  :  '  Phil.  Trans.,'  A,  vol.  191,  p.  214. 

t  The  effect  of  such  a  selection  as  the  above  in  reducing  correlation  is  dealt  with  in  my  paper  on  the 
influence  of  selection  on  correlation. 


M„  =  -6300,  M,  = 

a-„i  =  •less,  cr,,  = 

=  -0652. 


MATHEMATICAL  COlsTRIBUTION'S  TO  THE  THEORY  OF  EVOLUTION". 


305 


mares,  not  identical  with  the  series  in  (ii.)'"  and  working  out  their  dams'  records  most 
carefully,  rejecting  any  cases  in  which  the  breeder  was  clearly  sending  the  mare  to 
the  stallion  long  after  it  was  obvious  {post  facto)  that  she  was  sterile.  In  this  case 
four  coverings  were  retained  as  a  minimum,  and  the  results  are  given  in  Table  XIV. 
We  find  : 

M„,  =  -6369,  Ma  =  -6616, 

a,,  =  -1885,  o-a  =  -1604, 

r,„,=  -0995. 

The  coefficient  of  regression  =  *1I69. 

The  probable  error  of  the  correlation  is  -0193,  and  of  the  regression  -0194  ;  the 
correlation  is  accordingly  more  than  five,  and  the  regression  more  than  six  times  its 
probable  error.  We  conclude  that  fecundity  is  most  certainly  inherited.  The 
regression  found  is,  however,  only  about  two-fifths  of  what  is  required  by  the  law  of 
ancestral  heredity, 

(v.)  It  has  been  suggested  that  fertility  or  fecundity  might  alternate  in  ttvo 
generations  ;  when  the  offspring  are  numerous  their  offspring  might  have  less  fertile 
or  fecund  offspring.  I  do  not  see  how  this  would  be  possible  without  its  exercising 
an  influence  on  the  correlation  of  two  generations,  for  we  must  come  to  one  fertile 
followed  by  an  infertile  generation.  But  I  had  made  preparations  in  my  alphabet  of 
mares  for  testing  the  correlation  between  mares  and  their  granddams,  and  I  went  on 
to  the  construction  of  a  table,  although  the  results  for  mares  and  their  dams  showed 
me  that  whatever  result  might  be  reached,  it  would  be  within  the  probable  error  of 
the  observations.  I  reached  this  conclusion  in  the  following  manner  :  If  we  go  back 
one  generation  we  introduce,  owing  to  the  nature  of  the  record,  so  much  fictitious 
correlation  and  so  much  in-and-in  breeding  that  the  coefficient  of  inheritance  is 
reduced  to  two-fifths  or  less  of  what  its  value  should  be  according  to  the  law  of 
ancestral  heredity.  In  going  back  two  generations  we  come  to  fewer  mares,  to  more 
in-and-in  breeding,  and  to  just  the  type  of  famous  old  mare,  whose  breeder  kept  her 
at  the  stud  long  after  she  was  sterile.  I  expected  accordingly  a  great  and  artificial 
fall  in  the  fecundity  of  granddams  and  a  double  drop,  something  like  f  X  f ,  in  the 
value  of  the  regression  as  indicated  by  the  law  of  ancestral  heredity.  This  would 
reduce  the  apparent  regression  to  about  |  X  f  of  -15,  or  to  about  '025,  say,  a  value 
about  equal  to  the  probable  error  of  the  table.  The  results  actually  reached  are  given 
in  Table  XV.,  and  we  find,  if  the  subscript  (/  refer  to  granddam  : 

M„  =  -6345,  =  -0232, 

a-,,  =  -2040,  =  -1687, 

r,„^  =  -0169. 

The  coefficient  of  regression  =  '0204. 


*  In  the  first  series  the  mares'  names  run  from  A  to  G ;  in  the  second  from  G  to  M,  with  300 
additions  made  to  the  A  to  G  series,  while  I  was  completing  my  alphaljet, 
VOL.  CXCII. — A.  2  R 
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The  probable  error  of  the  correlation  is  '0213,  and  of  the  regression  •0213.  Thus 
these  results  are  not  significant  in  themselves,  but  they  are  exactly  what  we  might 
expect  on  the  above  hypothesis.  Taken  with  the  other  five  tables  which  we  have 
worked  out  for  the  inheritance  of  fecundity,  they  are  significant,  for  every  one  of 
them  gives  a  positive  correlation,  however  small  it  be,  and  thus  adds  to  the  accu- 
mulated evidence  that  fecundity  is  a  heritable  character. 

(vi.)  It  remains  to  test  our  results  by  the  theory  developed  onpp,  269  et.  seq.  But 
a  diJEficulty  comes  in  here.  Turning  to  (xviii.)  and  (xix.)  on  p.  268,  we  cannot  feel 
justified  in  putting  Mi  =  M.,,  for  there  is  a  secular  difference  in  the  fecundity  of  mares 
and  dams,  owing  to  the  fecundity  of  the  older  brood-mares  being  based  on  a  longer 
period  and  liable  to  the  disturbing  causes  so  markedly  manifest  in  the  correlation  of 
mares  and  granddams  (see  my  remarks,  p.  305).  If  we  combine  (xviii.)  and  (xix.) 
we  find 

~  W,  =  r  -  M.\). 

Now  r  is  small,  and  it  will  accordingly  be  legitimate  to  put  M'l  =  M'g  and  o-j  =  ctj 
on  the  right,  we  have  then 

(M"2-rM"0/(l-r)  =  M'2. 
From  this  we  deduce  for  the  results  in  (ii.)  on  p.  304 

M'o=-6321. 
Turning  now  to  (xix.),  it  may  be  written 

The  second  term  in  the  curled  brackets  is  small,  and  in  it  we  may  put  to  a  first 
approximation  cr'i  =  cti  =  ctj  and  Mi  =  M.\.    We  then  have 

c    .  /-isssv  /  1  \"! 

^  ^  =      ] '  +  (-63^)  L  ,  /-isssy    1    ]  ■ ' 

\       V6321/  (-5132)7, 

or, 

M"i  =  M'l  X  1-0666. 

Substituting  the  value  of  M",  we  find 

M'l  =  -6118. 

We  thus  see  a  diff'erence  in  the  fecundities  of  the  unweighted  dams  and  unweighted 
mares  of  '6118  and  '6321,  or  about  2  foals  more  in  the  hundred  appear  to  survive  in 
the  later  generation.  This  is  very  probably  due  to  the  causes  already  indicated  as 
affecting  the  apparent  fecundity  of  the  older  mares  (see  p.  298).    The  influence  of 
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reproductive  selection  changes  these  quantities  to  "6337  in  the  case  of  the  daughters, 
and  to  the  apparent  high  fecundity  of  -6525  in  the  case  of  the  dams. 

We  can  now  find  o-.>  to  a  second  approximation  by  aid  of  (xxi.).  In  the  small  term 
multiplied  by  r,  we  put  cti  =      =  a"o.    Hence  we  find 

o-?=  o-l  +  r^(cr'1-o-"r), 

and  deduce,  on  substituting  the  numerical  values, 

0-,  =  -1896, 

or  is  scarcely  different  from  o-",.  We  accordingly  conclude  that  we  may  quite 
reasonably  assume  the  variability  of  the  mares  to  represent  the  variability  of  the 
mares  without  reproductive  selection,  but  the  effect  of  weighting  the  dams  with 
their  fertility  is  to  reduce  the  variability  of  the  dams  from  about  -1896,  if  there  be 
no  secular  change,  to  an  apparent  value  as  low  as  '1643. 

The  same  formulae  applied  to  the  slightly  better  results  in  (iv.)  on  p.  305  give  us  : 

M'l  =  -6205,  and  M',  =  -6342. 

If  we  pass  back  from  M'j  and  M'o  to  Mj  and  M.,  we  find : 


First  case. 

Seoond  case. 

M,     .       .  . 
Ma     .  . 

•54G0 
•6266 

•5567 
•6278 

If  these  results  be  considered  as  valid,  we  notice  a  remarkable  difference  between 
the  fecundity  of  the  younger  and  elder  generation.  While  the  crude  results  on 
pp.  304  and  305  might  lead  us  on  first  examination  to  suppose  the  elder  generation 
more  fecund  than  the  younger,  these  results  show  us  that  it  is  distinctly  less  so. 
The  greater  part  of  the  difference,  however,  is  due,  not  to  a  secular  change,  but  to 
the  causes  we  have  so  often  referred  to  as  weakening  the  fecundity  recorded  for  the 
older  mares.  At  the  same  time  the  whole  system  of  breeding  is  so  artificial  that  we 
may  well  doubt  whether  our  equations  (i.)  and  (v.)  can  be  legitimately  applied.  For 
the  chance  of  a  mare  getting  into  the  stud-book  as  a  dam,  i.e.,  having  daughters  at 
the  stud,  depends  less  on  her  fertility  than  on  the  degree  of  fashion  in  her  stock. 
Thus  the  record  weighting  with  fertility  is  hardly  a  probable  hypothesis,  and  the 
Values  just  given  for  Mj  are,  I  suspect,  much  below  what  they  should  be.  For  the 
above  reason  1  have  not  proceeded  to  consider  the  changes  in  variability  connoted  by 
(ii.)  and  (xxii.).  As  I  have  made  no  attempt  to  form  a  correlation  table  for  mares 
and  dams  in  which  the  dam  would  have  only  one  daughter  to  her  record,  I  cannot 
make  any  plausible  guess  at  the  real  magnitude  of  the  cubic  summation  term  in 

2  B  2 
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(xxii.).  Apart,  however,  from  the  numerical  application  of  these  variation  formulae 
to  a  somewhat  doubtful  case,  we  see  in  these  formulse  the  theoretical  basis  for  the 
observed  fact  that  the  fecundity  of  mothers  is  far  less  variable  than  that  of  daughters. 
It  is  really  only  an  apparent  divergence,  due  to  the  fact  that  the  mothers  have  been 
weighted  with  their  fertility  ;  this,  while  it  increases  the  apparent  mean  of  their 
fecundity,  reduces  its  apparent  variability. 

(17.)  Oil  the  Inheritance  of  Fecundity  in  the  Brood-mare  through  the  Male  Line. 

For  the  thoroughbred  horse  this  problem  is  fairly  easily  answered  by  investigating 
whether  mares  related  to  the  same  stallion  have  any  correlation  between  their 
fecundities.  The  two  cases  I  have  selected  are  :  (i.)  "  Sisters,"  daughters  of  the  same 
sire,  but  in  general  not  of  the  same  mare  ;  and  then  (ii.)  "Nieces"  and  "Aunts,"  or 
daughters  of  a  sire  and  the  daughters  of  his  sire.  As  we  have  only  760  sires  and 
nearly  5000  mares,  the  daughters  or  aunts  fall  into  rather  large  arrays,  and  we  are 
compelled  to  use  the  methods  discussed  in  Proposition  IV.,  A  and  B.  Even  so  the 
arithmetical  work  for  a  correlation  based  on  the  index  of  sires  was  far  more  laborious 
than  for  one  based  on  the  index  of  mares. 

(i.)  To  find  the  Correlation  hettveen  Half-Sisters,  Daughters  of  the  same  Sire. 

Here  we  have  to  use  formulse  (xxiii,),  (xxv.),  and  (xxvi.)  of  pp.  272-273.  In  order 
to  do  this  a  table  was  formed  of  the  mean  fecundity  M  of  the  array  of  sisters  due  to 
each  sire,  and  of  ^n{n  —  1),  the  number  of  pairs  of  sisters  in  each  array.  Then  the 
products  |w  {n  —  1)  M  and  ^n  {n  —  1)M"  were  formed,  and  the  numerator  of  (xxiii.), 
or  a-'i,  calculated  by  adding  up  for  all  the  760  sires.    The  result  gave  : 

a-l  =  -6655167, 

where  the  unit  is  the  fecundity  group  element  of  1/15.  The  number  of  pairs  of 
sisters  dealt  with  was  54,305.  The  denominator  0-5(1  —  p^)  +  crl  is  not  so  easily 
ascertained,  o-o  is  the  standard  deviation  of  all  the  series  of  mares  who  are  sisters 
without  weighting;  o-ov/(l  —  p')  is  the  standard  deviation  of  an  array  of  sisters,  or 
if  the  regression  be  not  linear,  the  mean  of  such  standard  deviations  for  all  arrays, 
or  rather  its  square  is  the  mean  of  the  squares  of  such  standard  deviations  ;  p  is  the 
correlation  between  a  patent  character  in  the  daughter  and  a  purely  latent  character 
in  the  sire,  and  cannot  therefore  be  found  directly. 

In  order  to  get  an  appreciation  of  the  standard  deviation  of  an  array  of  sisters — 
it  being  practically  impossible  to  work  out  these  quantities  for  760  arrays — I  selected 
twenty  sires  having  fairly  large  arrays  of  daughters,  and  reached  the  following 
results  : 
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Table  of  Arrays  of  Mares,  which  are  Half-Sisters. 


to  ire. 

•NT  t 

JN  0.  01  mares. 

iVToQ  m    tciT»T1  1  1  TTT 

iyxtJd/11  Id  uji  1  (j  y  . 

^   1  1     f\T  Q-yi^Q-^r 

76 

y  oy  / 

z  yoy 

52 

10  750 

2  545 

67 

9  zOl 

o  175 

Newminster  

64 

8  875 

2  497 

Parmesan  .  .  

37 

9  708 

3  076 

81 

10  210 

2/70 

53 

9-689 

2  74b 

Lord  CI  if  den  

41 

Q-Q7Q 

79 

9  4d7 

o  003 

Blair  Atliol  

87 

9-057 

2-752 

The  Duke  

35 

9-186 

2-474 

Q-9Q7 

DO 

x.\j  ^DO 

43 

9-267 

2-847 

DV 

Stockwell  

80 

9131 

2-003 

40 

8-222 

2-201 

Y.  Melbourne  

55 

9-064 

2-301 

41 

9-951 

1-651 

Totals  

1117 

191-143 

53-259 

Mean  

55-85 

9-55715 

2-66295 

Ditto  in  actual  units*  . 

•6371 

■1775 

I  next  took  the  mean  and  standard  deviation  of  the  1117  mares  to  obtain  ctq. 
The  mean  fecundity  was  now  found  to  be  9'5685  and  ctq  =  2'7824,  or  in  actual  units 
•6379  and  *1855.  Clearly  only  about  \  per  cent,  difference  is  made  whether  we 
take  the  mean  fecundity  of  the  1117  mares,  or  the  mean  of  the  unweighted  means 
of  the  twenty  arrays.  Knowing  croandcro\/l  —  p'  we  can  now  find  p.  We  have 
almost  at  once 

p  =  -2900. 

This  is  probably  the  first  determination  of  a  coefficient  of  inheritance  between  a 
latent  character  in  one  sex  and  a  patent  character  in  the  other  sex.  "We  see  that  it 
has  almost  exactly  the  value  required  ('S)  by  the  law  of  ancestral  heredity,  or  we 
conclude,  mares  inherit  from  their  sires  a  fecundity  governed  closely  by  the  law  of 
ancestral  heredity. 

If  the  reader  asks  why  is  not  the  intensity  reduced  in  this  case  in  the  same 
manner  that  we  find  it  reduced  in  the  case  of  the  inheritance  from  the  dam,  the 
reply  is  : 

(i.)  In  the  case  of  the  dam  and  mare,  both  quantities  to  be  correlated  are  liable  to 
fictitious  values.    In  the  case  of  sire  and  mare,  we  deal  with  only  one. 

*  A  fecundity  unit  is  taken  to  be  1/15,  for  this  is  the  unit  of  gTOuping;. 
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(ii.)  The  influence  of  fictitious  values  has  been  shown  on  pp.  276-277  to  chiefly 
affect  the  coefficient  of  correlation  and  not  the  standard  deviation. 

Now  the  present  result  is  based  solely  on  the  calculation  of  standard  deviations, 
or  on  the  variability  of  fecundity  as  a  whole  and  in  arrays.  It  is  accordingly  not 
influenced  to  nearly  the  same  extent  by  the  existence  of  fictitious  values.  Could  we 
calculate  the  variability  of  the  arrays  of  daughters  due  to  individual  mares,  we 
should  probably  get  a  better  result  for  inheritance  in  the  female  line.* 

The  above  result  is  so  satisfactory  that  I  have  little  doubt  that  we  have  deter- 
mined a  very  good  value  for  cro\/ 1  —  pi  Substituting  it  we  find  for  the  correlation 
between  half-sisters  : 

•66552 
7-09130  +  -66552 

The  law  of  ancestral  heredity  gives  for  7ia^-sisters  r  =  '2,  and  f  of  this  =  '08. 

Thus  we  see  that  the  collateral  heredity  between  half-sisters,  daughters  of  the 
same  sire,  is  quite  sensible,  and  is  almost  what  we  might  have  predicted  would  be 
the  result,  if  we  supposed  correlation  to  be  weakened,  as  in  the  previous  cases,  to 
■f  of  its  value  by  fictitious  records. 

It  is  worth  while  to  consider  the  amount  of  fictitious  fecundity  suggested  by 
the  reduction  factor  f.  We  have  only  to  suppose  the  n^/N  of  our  p.  277  to  be  f. 
Now  we  may  well  assume  the  chance  of  a  fictitious  fecundity  being  recorded  to  be 
the  same  for  either  one  of  a  pair  of  sisters  ;  hence  we  shall  have  p  ■=  q,  and  therefore, 
from  the  result  on  p.  276,  we  find  {p  —  i-f/p'  =  f.  This  gives  us  {p  —  l)p  =  \/'4, 
and  (ni  -|-  n3)/N  the  fraction  without  fictitious  values  =  {p  —  l)/p  =  •6325.  Thus 
in  order  to  introduce  the  reduction  factor  of  f  by  the  occurrence  of  fictitious  values 
of  the  fecundity,  we  should  have  to  suppose  about  37  per  cent,  of  fictitious  values  to 
occur.  This  is,  of  course,  a  sort  of  average  ;  many  values  will  probably  be  only 
partially  fictitious,  i.e.,  will  to  some  extent  approximate  to  their  real  values. 
Considering  the  very  artificial  character  of  the  thoroughbred  brood-mare,  and  the 
uncertainty  of  her  treatment  by  breeders,  this  does  not  seem  such  an  immense 
percentage  that  it  would  force  us  to  the  conclusion  that  the  law  of  ancestral  heredity 
cannot  be  true  for  the  inheritance  of  fecundity. 

(ii.)  To  Jind  the  Correlation  in  Fecundity  hetiveen  the  Sisters  of  a  Sire  and  his 
Daughters. 

What  we  want  is  really  the  correlation  between  aunts  and  nieces,  but  they 

*  The  standard  deviations  for  the  arrays  of  mares  in  Table  XII.  were  indeed  worked  out  for  tlie 
twelve  cases  of  dams  from  e  to  q.  The  mean  of  these  cases  was  sensibly  the  same  whether  the  simple 
mean,  or  the  mean  weighted  with  the  numbers  in- the  array  was  taken,  and  equalled  2'8091  or  -1823.  This 
ifi  0-0(1  —  r'^y-'\  Bui  by  p.  48,  o-q  =  -1888,  whence  we  deduce  r  '1375,  and  the  regression  equals  -1581. 
Thus  we  have  found  a  substantially  larger  value  for  ?•  than  that  on  p.  304  by  dealing  with  variabilities, 
Hud  not  direct  correlations.  Tins  gives  additional  evidence,  if  any  were  needed,  of  the  inheritance  of 
fecundity. 
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are  not  "  aunts  "  and  "  nieces "  in  the  human  sense,  for  the  aunts  are  only  half- 
sisters  of  the  sire.  By  a  process  similar  to  that  on  pp.  408  and  409  of  my  paper 
on  the  "  Law  of  Ancestral  Heredity," I  deduce  that  the  correlation  between  a  sire's 
sisters  and  daughters  ought  to  be  -05,  and  not  '15  as  in  the  case  of  Man.  If  this  be 
weakened  down  to  the  f  of  previous  results,  we  should  not  expect  a  result  differing 
much  from  '02.  As  the  variability  of  the  elder  generation  is  always  less  than  that 
of  the  younger,  we  ought  to  expect  a  coefficient  of  regression  of  about  this  value. 
The  theory  used  will  be  that  of  p.  273  of  the  theoretical  part  of  this  paper.  The 
weighted  mean  fecundity  found  for  the  arrays  of  aunts  and  nieces  was  as  follov's  : — 


Without  grouping. 

With  grouping. 

Arrays  of  nieces  .... 

•6195 
•6346 

•6199 
•6338 

The  grouping  was  done  in  fecundity  units  of  |-,  i.e.,  1/30  change  in  fecundity.  The 
agreement  may  accordingly  be  considered  very  good.  The  "  aunts  "  are  the  daughters 
of  the  older  sires,  who  owing  to  in-and-in  breeding  form  a  comparatively  small  group, 
'  and  are  the  sires  of  mares  belonging  to  the  older  period,  whose  fecundity  is  much 
weakened  by  causes  already  referred  to.  Their  mean  fecundity  is  slightly  less  than 
that  of  granddams,  given  on  p.  305,  while  the  mean  fecundity  of  their  nieces  agrees 
well  with  that  for  the  corresponding  group  of  mares. 

The  method  of  grouping  being  adopted,  a  correlation  table  was  formed  for  the 
mean  fecundities  of  arrays  of  mares,  daughters  of  a  sire,  and  of  arrays  of  mares, 
daughters  of  his  sire.  This  is  Table  XVI.  Here  each  mean  is  weighted  with  the 
number  of  pairs  of  aunts  and  nieces  in  the  two  arrays,  i.e.,  the  extent  of  the  data  on 
which  it  is  based.  It  represents  accordingly  138,424  pairs  of  aunts  and  nieces. 
The  following  results  were  obtained,  corresponding  to  687  pairs  of  sires  : — 

Sire's  Sire.  Sire. 
^^^^  M„  =  -6199.  M„,  =  -6338. 

^„  =  -04344.  =  -07609. 

R=-1174. 

It  will  be  at  once  noticed  how  much  more  variable  are  the  array-means  for  the 
sire  than  for  the  sire's  sire.  The  means  of  many  of  the  sire's  arrays  are  based  upon 
small  numbers,  which  would  have  been  selected  oiit,  if  we  had  gone  to  another 
generation  as  in  the  case  of  the  sire's  sire. 

It  will  clearly  not  be  legitimate  in  this  case  to  put  a\  =  cr^  as  suggested  on  p.  274. 
There  is  probably  no  secular  change  of  importance  here,  but  the  sire's  sire  requiring 


*  'Roj.  See.  Proc.,'  vol.  62. 
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three  generations  from  the  record  is  really  more  stringently  selected  than  the  sire 
with  only  two.  We  can  now  form  a  and  cr'  by  (xxviii.)  and  (xxix.),  if  we  adopt 
suitable  values  of  cto  and  cr'o,  p,  as  we  have  seen,  may  with  high  probability  be  put 
equal  to  -3  (p.  309).  ctq  for  groups  of  daughters,  on  p.  309,  is  given  as  "1855,  but  since 
this  certainly  included  a  fair  number  of  what  are  now  aunts,  it  must  be  somewhat  too 
low  for  cr'o.  We  can  well  put  cr'o  equal  to  the  '1888  of  the  mares  on  p.  304.  o-q  for  aunts 
cannot  be  as  low  as  the  standard-deviation  of  dams  on  that  page,  as  many  of  the 
aunts  may  never  appear  in  the  record  as  granddams,*  i.e.,  they  are  less  stringently 
selected.    The  mean  of  the  two  results  for  mares  and  dams  may,  perhaps,  be  taken 

as  a  close  enough  approximation  for  our  present  purpose,  or  o-q  =  '1765.  We  then 
deduce 

a  —  -1739,        a'  —  '1955. 

If  we  compare  the  results  now  found  with  those  for  sisters  cited  on  pp.  308  and  309, 
we  find  : — 


"Aunts." 

"  Sisters." 

"  Nieces." 

M 

■6199 

•6371 

•6338 

•0434 

•0544 

•0761 

a 

•1739 

•1855 

•1955 

The  accordances  and  divergences  are  much  what  we  might  expect,  except  in  the 
case  of  cr„.  We  should,  d  priori,  have  expected  "  sisters "  to  have  approached 
nieces  more  nearly  than  aunts.  The  work  has  been  gone  carefully  through,  but  I 
have  not  succeeded  in  finding  any  error.  In  the  "  nieces,"  of  course,  the  weighting 
of  an  outlying  fecundity-mean  due  to  a  sire  with  but  few  daughters,  may  still  be 
large,  if  his  sire  have  numerous  daughters ;  this  cannot  occur  in  the  case  of  "sisters," 
as  the  weighting  depends  only  on  the  number  in  the  array.  The  like  heavy 
weighting  cannot  usually  occur  in  the  case  of  "  aunts,"  for  they  are,  as  a  rule  (owing 
to  selection  to  the  third  generation)  daughters  of  old  and  famous  sires,  with  plenty 
of  material  for  basing  averages  upon.  We  do  not  get  many  ''nieces"  attached  to 
"  aunts,"  who  are  not  daughters  of  famous  sires.  Such  is  probably  the  source  of 
divergence  in  cr„  between  nieces  and  sisters. 

Using  formula  (xxviii.),  on  p.  274,  we  find 

?''=-0114, 

and  for  the  regression  coefficient  •0128. 

This  value  is  much  below  the  '05  of  the  law  of  ancestral  heredity,  and  below  the 
reduced  value  "02,  which  we  might  have  expected  to  reach.    Still,  it  again  shows 

*  Every  dam  appears  as  a  granddam,  otherwise  the  fecundity  of  the  daughter  could  not  have  been 
found. 
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positive  correlation,  and  we  may  conclude  that  the  patent  character  in  the  daughter 
is  inherited  latently  through  the  male  line.* 

But  there  is  another  and  far  more  significant  method  of  looking  at  this  result, 
namely,  by  considering  the  meaning  of  E  on  p.  274.  We  may  treat  the  fecundity  of 
daughters  as  really  a  character  of  the  sire,  and  their  mean  fecundity  as  a  measure  of 
a  latent  character  in  him.  R  is  then  the  correlation  between  a  latent  character  in 
both  a  stallion  and  his  sire,  and  we  see  that  it  is  sensibly  inherited  for  R  =  '1174. 
To  compare  with  the  law  of  ancestral  heredity,  we  must  use  the  coefficient  of  regres- 
sion, for  the  stallions  are  much  more  variable  than  their  sires.    We  find 

Regression  of  stallion  on  sire  =  '2056, 
which  carries  us  a  long  way  in  the  direction  indicated  by  that  law.  Thus  it  is 
extremely  probable  that  this  law  of  inheritance  applies  not  only  to  the  inheritance  of  a 
patent  character,  or  of  a  character  latent  in  one  sex  and  transmitted  to  a  second,  but 
also  to  the  inheritance  of  a  character  latent  both  in  the  transmitter  and  receiver.  The 
present  method  accordingly  seems  applicable  to  the  inheritance  of  a  character  latent  in 
two  individuals,  if  we  take  the  mean  of  the  character,  when  patent  in  the  offspring,  as  a 
measure  of  its  strength  in  the  individual  in  whom  it  is  latent.  If  li  be  the  measure 
of  a  latent  character  in  a  parent,  then  the  offspring  will  have  a  mean  value  +  Cj 
of  this  character,  where  q  is  the  coefficient  of  parental  regression  and  Cj  a  constant. 
If  U  be  the  measure  of  the  same  latent  character  in  a  relative,  then  the  offspring  in 
this  case  will  have  ql^  +  Cn  of  the  character.  But  the  correlation  of  and  L  will  be 
identical  with  that  of  ql^  +  Cj  and  qL  +  c.,  as  I  have  shown  elsewhere.t  Thus  the 
mean  of  the  patent  character  in  the  offspring  may  be  used  to  measure  the  correlation 
between  latent  characters  in  their  parents. 

To  sum  up  our  results  for  thoroughbred  mares,  we  conclude  that  their  fecundity, 
notwithstanding  the  imperfections  and  difficulties  of  the  record,  has  been  demon- 
strated to  be  inherited,  and  this,  both  through  the  male  and  female  line,  so  far  as 
we  can  judge,  with  an  equal  intensity.  The  apparent  value  of  this  intensity,  except 
in  the  case  of  latent  characters,  is  much  below  that  required  by  the  law  of  ancestral 
heredity,  roughly,  perhaps,  2/5  of  that  value ;  but  there  is  considerable  reason 
to  think  that  this  reduction  may  take  place  owing  to  the  presence  of  fictitious  values 
in  the  record  arising  from  the  peculiar  circumstances  under  which  thoroughbred 
horses  are  reared  and  bred.  These  fictitious  values  would  hardly  influence  the 
means  and  variability  of  arrays  like  they  must  do  the  relationship  between  pairs  of 
individuals.  Hence,  when  we  deal  with  such  means  and  variabilities  as  in  the  cases 
on  pp.  309  and  313,  we  find  a  much  closer  approach  to  the  law  of  ancestral  heredity. 
Fecundity  is  certainly  inherited  ;  that  it  is  inherited  according  to  the  Galtonian  law 

*  As  a  matter  of  fact,  this  conclusion  is  stronger  than  it  appears  here,  for  the  correlation  between 
nieces  and  aunts  was  worked  out,  without  grouping,  for  fourteen  distinct  series,  and  in  thirteen  of  them 
was  found  to  be  sensibly  positive ;  in  the  fourteenth  it  was  found  to  have  an  insignificant  negative 
value. 

t  "  On  the  Reconstruction  of  the  Stature  of  Prehistoric  Races,"  '  Phil.  Traus.,'  A,  vol.  192,  p.  183. 
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is  not  demonstrated,  hut  may  be  treated  as  probable  until  the  results  of  further 

mvestigations — preferably  by  breeding  experiments  instituted  for  this  very  purpose  

are  available. 

(18.)  Co7iclusion. —The  investigations  of  this  memoir  have  been  to  some  extent 
obscure  and  difficult,  but  the  general  result  is  beyond  question. 

FertiliUj  and  fecundity,  as  shown  hy  investigations  on  mankind  and  on  the 
thoroughbred  horse,  are  inherited  characters. 

The  laws  of  inheritance  of  these  characters  are  with  considerable  probability  those 
already  developed  in  my  memoir  on  the  Law  of  Ancestral  Heredity  for  the  inherit- 
ance of  directly  measurable  organic  characters. 

In  the  course  of  the  work  it  has  been  shown  how  a  numerical  measure  may  be 
obtained  for  the  inheritance  of  a  character  by  one  sex  from  the  otiier,  when  it  is 
patent  in  the  former  and  latent  in  the  latter.  Fertility  and  fecundity  purely  latent 
in  the  male  (in  the  sense  here  used)  are  shown  to  be  transferred  by  him  from  his 
mother  to  his  daughter.  Thus  Darwin's  views  with  regard  to  the  transmission 
through  one  sex  of  a  character  peculiar  to  the  other  are  given  a  quantitative 
corroboration. 

When  we  turn  from  these  points  to  their  weight  and  importance  for  the  theory  of 
evolution,  we  are  at  once  encountered  by  all  the  wide-reaching  principles  which  flow 
from  the  demonstration  that  genetic  (reproductive)  selection  is  a  true  factor  of 
development.    Let  us  look  at  these  a  little  more  closely. 

If  natural  selection  were  to  be  absolutely  suspended,  i.e.,  if  there  were  no 
differential  death-rate  at  all,  then  development  would  not  for  a  moment  cease.  Not 
only  is  fertility  inherited,  but  there  can  be  small  doubt  that  it  is  closely  correlated 
with  all  sorts  of  organic  characters  ;  thus  the  inheritance  of  fertility  marks,  the 
moment  natural  selection  is  suspended,  a  progressive  change  in  a  great  variety  of 
organic  characters.  Without  a  diflerential  death-rate  the  most  fertile  will  form  in 
every  generation  a  larger  and  larger  percentage  of  the  whole  population.  There  are 
very  few  characters  which  may  not  be  supposed  to  be  more  or  less  directly  correlated 
with  fertility,  and  in  reproductive  selection  we  see  a  cause  of  progressive  change 
continuously  at  work.t  There  is,  so  to  speak,  in  every  species  an  innate  tendency 
to  progressive  change,  quantitatively  measurable  by  determining  the  correlation 
coefficients  between  fertility  and  organic  characters,  and  between  fertility  in  the 
parents  and  in  the  offspring.  This  "innate  tendency"  is  no  mysterious  "force" 
causing  evolution  to  take  place  in  a  pre-ordained  direction  ;  it  is  simply  a  part  of  the 
physical  organisation  of  the  individual,  which  does  not  leave  fertility  independent  of 

*  The  method  is  perfectly  geueral,  and  a  value  can  always  be  found  for  the  intensity  of  transmission 
of  a  sexual  character  through  the  opposite  sex.  We  could  obtain,  for  example,  a  numerical  measure  of 
the  manner  in  which  a  bull  transmits  good  milking  qualities  to  its  offspring. 

t  I  have  endeavoured  to  show  ('  Roy.  Soc.  Proc.,'  vol.  59,  p.  303),  that  fertility  is  correlated  with 
stature  in  woman.    I  hope  later  to  return  to  the  correlation  of  fertility  and  physique. 
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physique  and  organic  relationship,  or  leave  these  characters  uncontrolled  by  the 
principle  of  heredity.  It  seems  to  me,  therefore,  that  the  results  of  this  memoir 
force  on  us  some  modification  of  current  views  of  evolution.  The  suspension  of  natural 
selection  does  not  denote  either  the  regression  of  a  race  to  past  types,  as  the 
supporters  of  panmixia  suggest,  or  the  permanence  of  the  existing  type,  as  others 
have  believed.  It  really  denotes  full  play  to  genetic  or  reproductive  selection,  which 
will  progressively  develop  the  race  in  a  manner  which  can  be  quantitatively  predicted 
when  once  we  know  the  numerical  constants  which  define  the  characters  of  a  race 
and  their  relation  to  racial  fertility.  In  other  words,  natural  selection  must  not  be 
looked  upon  as  moulding  an  otherwise  permanent  or  stable  type  ;  it  is  occupied  with 
checking,  guiding,  and  otherwise  controlling  a  progressive  tendency  to  change. 

So  soon  as  a  species  is  placed  under  a  novel  environment,  either  artificially  or 
naturally,  the  equilibrium  is  disturbed,  and  it  will  begin  to  progress  in  the  manner 
indicated  by  genetic  (reproductive)  selection,  until  this  progress  is  checked  by  the 
development  of  characters  in  a  manner  or  to  an  extent  which  is  inconsistent  with 
fitness  to  survive  in  the  new  surroundings.  Within  a  very  few  generations  a  novel 
environment,  sympathetic  so  to  speak  to  the  progressive  tendency  indicated  by 
reproductive  selection,  produces  the  suitable  variations  without  the  assistance  of 
natural  selection.  It  seems  to  me  that  this  principle  ought  to  be  borne  in  mind 
when,  in  laboratory  experiments  or  in  artificial  breeding,  natural  selection  is  wholly 
or  largely  suspended,  or  again  is  altered  in  type ;  the  species  dealt  with  is  unlikely 
to  remain  constant  for  several  generations,  but  will  develop  in  the  direction  indicated 
by  genetic  selection.  Further,  when  stable  types  of  life  like  the  English  sparrow  are 
taken  to  America,  or  the  English  rabbit  to  Australia,  where  initially  they  fill  a  more  or 
less  vacant  field  among  living  forms,  and  natural  selection  is  in  part  suspended,  we 
should  expect  in  a  few  generations  a  considerable  divergence  in  type.*  The  converse 
aspect  of  the  problem  is  also  of  great  importance ;  namely,  the  natural  selection 
of  physical  characters  must  tend  to  indirectly  modify  fertility  and  fecundity,  if 
these  be  correlated  with  those  characters.  Variations  in  the  fertility  of  local  races 
need  not  be  looked  upon  as  due  directly  to  environment,  but  may  arise  from  the 
selection  of  characters  correlated  with  fertility,  combined  with  the  law  that  fertility 
is  itself  an  inherited  character. 

Lastly,  the  inheritance  of  fertility  involves  the  "acceleration"  of  fertility;  a 
race,  natural  selection  being  suspended,  tends  not  only  to  increase  but  to  increase 
at  an  increasing  rate.  This  principle  is  again  full  of  meaning,  not  only  for  the  study 
of  the  manner  in  which  lower  types  of  life  rapidly  expand  under  changed  environ- 
ment, but  also  for  the  problems  set  to  those  philosophers  who  may  desire  that  the 
most  social  and  not  the  most  fertile  type  of  citizen  may  predominate  in  our  modern 
civilised  communities,  where  the  state  and  public  opinion  to  a  greater  or  less  extent 
hinder  natural  selection  from  playing  the  great  part  it  does  in  wild  life. 

*  It  would  be  interesting  to  know  whether  the  size  or  frequency  of  the  litter  ot  the  Australian 
rabbit  is  greater  than  that  of  the  English. 
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NOTE. 

In  August,  1899,  I  presented  a  memoir  to  the  Royal  Society  on  the  inheritance  of  coat-colour  in  the 
horse  and  of  eye-colour  in  man,  which  was  read  November,  1899,  and  ultimately  ordered  to  be  published  in 
the  '  Phil.  Trans.'  Before  that  memoir  was  printed,  Mr.  Yui.e's  valuable  memoir  on  Association  was  read, 
and,  further,  Mr.  Leslie  Bramley-Moore  showed  me  that  the  theory  of  my  memoir  as  given  in  §  6  of  the 
present  memoir  led  to  somewhat  divergent  results  according  to  the  methods  of  proportioning  adopted. 
We  therefore  undertook  a  new  investigation  of  the  theory  of  the  whole  subject,  which  is  embodied  in  the 
present  memoir.  The  data  involved  in  the  paper  on  coat-colour  in  horses  and  eye-colour  in  man  have  all 
been  recalculated,  and  that  paper  is  nearly  ready  for  presentation.*  But  it  seemed  best  to  separate  the 
purely  theoretical  considerations  from  their  application  to  special  cases  of  inheritance,  and  accordingly  the 
old  memoir  now  reappears  in  two  sections.  The  theory  discussed  in  this  paper  was,  further,  the  basis  of  a 
paper  on  the  Law  of  Reversion  with  special  reference  to  the  Inheritance  of  Coat-colour  in  Basset  Hounds 
recently  communicated  to  the  Society,  and  about  to  appear  in  the  '  Proceedings.'! 

While  I  am  responsible  for  the  general  outlines  of  the  present  paper,  the  rough  draft  of  it  was 
taken  up  and  carried  on  in  leisure  moments  by  Mr.  Leslie  Bramley-Moore,  Mr.  L.  N.  Gr.  Filon,  M.A., 
and  Miss  Alice  Lee,  D.Sc.  Mr.  Bramley-Moore  discovered  the  ^-functions ;  Mr.  Filon  proved  most  of 
their  general  properties  and  the  convergency  of  the  series ;  I  alone  am  responsible  for  sections  4,  5,  and  6. 
Mr.  Leslie  Bramley-Moore  sent  me,  without  proof,  on  the  eve  of  his  departure  for  the  Cape,  the 
general  expansion  for  z  on  p.  26.  I  am  responsible  for  the  present  proof  and  its  applications.  To  Dr. 
Alice  Lee  we  owe  most  of  the  illustrations  and  the  table  on  p.  17.  Thus  the  work  is  essentially  a 
joint  memoir  in  which  we  have  equal  part,  and  the  use  of  the  first  personal  pronoun  is  due  to  the  fact 
that  the  material  had  to  be  put  together  and  thrown  into  form  by  one  of  our  number. — K.  P. 
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§  (1.)  On  a  General  Theorem  in  Normal  Correlation. 
Let  the  frequency  surface 


1     /  Jf^.y^  _  2ray\ 


where 


N  =  total  number  of  observations, 
cr^,  cTo  =  standard  deviations  of  organs  x  and  y, 
r  =  correlation  of  x  and  y, 

be  divided  into  four  parts  by  two  planes  at  right  angles  to  the  axes  of  x  and  y  at 
distances  li  and  k'  from  the  origin.  The  total  volumes  or  frequencies  in  these  parts 
will  be  represented  by  «,  h,  c,  and  d  in  the  manner  indicated  in  the  accompanymg 
plan : — 


Tdble  of  Frequencies 


d 

b 

d  +  b 

C 

d 

c+d 

d+C 

N 

Then  clearly 


27rv/(l 
N 


==;   e  ^'ir^ATi^-^Ti^-'^J  dxdy 


e-^Y^S-'^y'-''''y'^dxdy 


if 


27r^(l  -  r^) 

li  —  Ji  /(Ti    and    h  =  ^7*^2 
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Further, 


and 


h  +  d 


_  ^  r 


o-i' 


hi 


c  +  d  =  ~\  e-'^y'dy   (lii.), 

^a  +  c)-(l  +  cl)_   (iv.). 

VI  t^'"'^  (-)• 


{a  +  h)  -  (c  +  cl) 


Thus,  when  a,  6,  c,  and  are  known,  h  and  ^  can  be  found  by  the  ordinary  table  of 
the  probabihty  integral,  say  that  of  Mr.  Sheppard  ('  Phil.  Trans.,'  A,  vol.  192.  p.  167, 
Table  VI.*).    The  limits  accordingly  of  the  integral  for  d  in  (i.)  are  known. 

Now  consider  the  expression 


x/1 


=  (3-^r^.<'^^-^^-^-^>  =  U,  say,  (vi.), 


and  let  us  expand  it  in  powers  of  r.    Then,  if  the  expansion  be 


(vii.), 


we  shall  have 


Taking  logarithmic  differentials,  we  get  at  once 


(1  -  ^')'  ^  =  {^y  +  r  (1  -      -  y^)  4-  T^xy  -  r^}JJ. 

Differentiating  n  times  by  Leibnitz's  theorem,  and  putting  r  =  0,  we  have,  after 
some  reductions 

=  n{2n  —  I  —  x^  —  y'^)u^_^ 
-  n{n  -  1)  (n  -  2fu„_^ 

■\-xy{u„  + n{n—l)u„_^}   .  (ix.). 


Hence  we  find 


Uo  = 


Uo  = 


1 

xy 

{x^-l){y^-l) 

(x*  -  6x^  +  3)  {y^  -  6y^  +  3) 

*  See,  however,  foot-note,  p.  5. 
B  2 


.  (X.) 
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Thus  the  following  laws  are  indicated  : — 

M;,  =  V,,  X  tv,,  .    .    .   ,    .    .  (xi.), 

where  v,,  =  xv,,_^  —  {n  —  l)v„_2  (xii.), 

=  ywn-i  —  {n  —   (xiii.). 

We  shall  now  show  that  these  laws  hold  good  by  induction.    Assume  7o^.y-<yu-e. 

Oa.  -Thus  Un+^  =  xyu,,  +  nH„_^  —  n{yWnV„-i  +  xv„w„_t^). 

-But  by  (ix.),  substituting  for  w.^.g  from  (xi.)/and  (xiii.),  /uoi!Zt^^  ^^--^  >fc^ 

'^"+1  =  xy  {v„io,,  +  n  {n  —  l)'y«_2?t/-;,_2j  +  n{2n  —  I  —  x^  —  y'^)v,,_-^iv„_^ 
—  n{n  —  l)v,,_^iv„_^  —  xyn{n  —  l)v„_3W^„_3 
-\-n{n  —  I)  {yv,_^itJ,,_2  +  xv,_2iv,,_-i). 

=  xyv,,iv„  +  irv,_^w,,_^  ~  n{x^  +  y^)v,,_^iv„_^ 
+  n{n  —  1)  {yv,,_^w,,_o^  +  xv„_2W„_-^) 

'  ?T  -  /    

=  xyVuWnAr  n^v,,_^Wn_^  — n{yVn_^w,,-\- xw,,_^vi)     try  x/j  co^^  x.///', 
—  '^n+\'^n+i>  as  we  have  seen  above. 
Thus,  if  the  theorem  holds  for  u^,  it  holds  for  Un^y  Accordingly 

U  =  e-^^-^  {l  +  "fiV  +  +  .  .  .  +  +...),..  (xiv.), 

where  the  v'&  and  vjb  are  given  by  (x.),  (xii.),  and  (xiii.). 

1  .  .  . 

It  is  thus  clear  that       ^    j        dx  dy  consists  of  a  series  of  which  the  general 


Ih     J  k 

term  is 

1 


n 


where  V„  =  -73=     e  ^'''v^.dx 


It  remains  to  find  these  integrals. 
The  general  form  of  v^i,  is  given  by 
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For  this  obviously  gives  (x.).    Assume  it  true  for        and  i',,._o,  then 

_ („ _  1)...,  =     -  +  0-l)0.-2)(^-3)(,.-4)^.,.,_ 

_  („  _  1),,,-.  +  (^-l)(^^-2)(»-3)^.._,  _ 

~"  2|1  ^  22  )2  —  ... 

Thus  the  expression  (xv.)  is  shown  to  hold  by  induction,  the  general  terms  being 

/     1  yi^'  -  1)  0^  -  2)  ...      -  2r  +  1)  fn  -  2r        \  ^^.^^ 
(-1)   2'-'  Ir-  1  I     2r     "T  M 


_  7^^-1)0.-2)  .  ■  ■  (n-2r  +  1) 


or  the  general  term  in  Un. 
We  notice  at  once  that 

^  =  ^^''-^  (^^^^•)- 

Thus,  by  (xii.) 

_  _  dv„_-^ 

Multiply  by  e"*""'  and  integrate 

Integrating  the  latter  integral  by  parts,  we  have 

V„  =  ^      V,fi-^^'dx  = 

Now  e~*'''  can  be  found  from  any  table  of  the  ordinates  of  the  normal  curve, 

e.g.,  Mr.  Sheppaed's,  '  Phil.  Trans.,'  A,  vol.  192,  p.  153,  Table  L*  We  shall  accord- 
ingly put 

H  =  -^e-i'\  ■  K  =  ~  e-i''   (xvii.), 

V  27r  V  27r  ^ 

and  look  upon  H  and  K  as  known  quantities. 

*  For  our  present  purposes  the  differences  of  Mr.  Sheppard's  tables  are  occasionally  too  large,  but  the 
following  series  give  very  close  results  : — 

Let  =  Vl  =  j  V-^^^  dx  by  (iv.). 

X,=.^l(jL±J^^±A  =  ^y.^dyhyiy.). 
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Further,  let  us  write  {v„_-^),,  ^  i  as  v„.  3,  and  similarly  =  k  as  Thus 

V„  =  H .     J,  W„  =  K .   (xviii.). 

We  have  then  from  (i.) 

^  f    f  \Jdxdy 


N  27r, 


(b  +  d)(c  +  -fr-        -  - 


'  7\|» 
by  (ii.)  and  (iii.). 

Or,  remembering  that  N  =  a  +  5  -f-  c  +  c/,  we  can  write  this 
ad  —  he      "  /  r»  — 


N^HK  ~  "-^ 

=  r+pik  +  '^{h'  -  1)(F  -  1)  +  ^h{h'^  -  S)k{k^  -3) 

+  -      +  ^)      -      +  ^) 

H- -  10/^2  _j_       ^  (^4  _         +  15) 

+  ^(^®  -  15^^  +         -  15)      -  15;^4  _j_  45;^3  _  15) 

+  ^^^(/^^  —  21^*  +  105/12  _  105)^(^6  _  2iy^4  _|_  105^  -  105)  +  ,  &c. 

....  (xix.). 

Then  h  =  xi  +  p- Xi^  +  |xi^  +  ^Xi'  +  -  ■  ■ 

1         /^A       1     2.7     ,  ,127    ,  , 

and  1     3      7     ,     127  . 

^  =  X2  +  13  X2'  +      X2'  +  -p^  X2^  +  •  •  ■ 

1         j^f,       1  v2      7     ,      127    ,  \ 
K  =  '^^'^(l  +  12  14       +  Ti  ■  •  ■)■• 

These  follow  from  the  considerations  that  if 


Xi  = 

X2  = 

x/2'r</)2, 

dh 

H, 

dk  ~ 

K, 

dH.  _ 

-  h, 

dK  _ 

-  k, 

#1  ~ 

#2  ~ 

whence  it  is  easy  to  find  the  successive  differentials  of  h  with  regard  to  and  k  with  regard  to  (^o,  and 
then  obtain  the  above  results  by  Maclaurin's  theorem.  There  is,  of  course,  no  difficulty  in  calculating 
H  and  K  from  (xvii.)  directly.    That  method  was  adopted  in  the  numerical  illustrations. 
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Here  the  left-hand  side  is  known,  and  since  h  and  k  are  kno-wn,  we  can  find  the 
coefficients  of  any  number  of  powers  of  r  so  soon  as  the  first  two  have  been  found, 
from  (xii.)  and  (xiii.). 

Accordingly  the  correlation  can  be  found  if  we  have  only  made  a  grouping  of  our 
frequencies  into  the  four  divisions,  a,  b,  c,  and  d. 

If  h  and  k  be  zero,  we  have  from  (xvii.)  and  (iv.) 

H  =  K  =  ^ 


x/2'jr 

The  right-hand  side  of  (xix.)  is  now 


r  +  i^r^  + 


or  equal  to  sin  ^  r. 

-r-r  •      ^     (^^  —  he) 

Hence  r  =  sm  27r  —  

=  cos  77   (XX.  ), 

which  agrees  with  a  result  of  Mr.  Sheppard's,  '  Phil.  Trans.,'  A,  vol.  192,  p.  141.  We 
have  accordingly  reached  a  generalised  form  of  his  result  for  any  class-index  whatever. 
Clearly,  also,  r  being  known,  we  can  at  once  calculate  the  frequency  of  pairs  of  organs 
with  deviations  as  great  as  or  greater  than  h  and  k. 

§  (2.)  Othe7'  Series  for  the  Determination  of  r. 

For  many  purposes  the  series  (xix.)  is  sufficiently  convergent  to  give  r  for  given 
h  and  k  with  but  few  approximations,  but  we  will  now  turn  to  other  developments. 
We  have  by  (vii.) 

\jJdr  =  + 1-  +  ...+„„  ^  +.  . 


Put  X  =  h,  y  =  k,  and  write  for  brevity 

ad  —  be 
^  ~  N^HK 

It  follows  at  once  from  (xix.)  that 

Jo 

_  p^/tHA")  f '   1  (A2  +  i.2  _  2rhL) 


(xxi.). 
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! 

rB  }  (xxii.), 

__  gi'i"  j  tan  e  -  /t  sec  e)^  J 

if  r  =  sin  ^, 

Now  either  of  the  quantities  under  the  sign  of  integration  in  (xxii.)  can  be  expanded 
in  powers  of  Q  by  Maclaukin's  theorem.    Thus  let 

^  _  g— tan  e  -  7i  sec 

=-+Ct)/+Ct)|H-....+(S)j;H-... 

Then 

and  it  remains  to  find  {'~\  . 

Now  log  X  =  —  i     tan  6*  —  /i  sec 

Hence 

cos3  6*  ^  =  -  X  [(Z^'"  +      sin  ^  -  AZ;  (f  -  i  cos  2^)]. 


DifiPerentiating     —  1  times  by  Leibnitz's  theorem,  and  putting  6  =  0, 

+  ("-l)-;_(';-'-  +  L)[eos  f  {'i^(3  +  3')  -  4(A=+i')} 
+  sinf2'MlM  +  ...  =  0  (xxiii.). 


J  V^"^  /O 

Clearly      =         then  we  rapidly  find 

(^)^  =  M  (A^F  -  3(7.2  +  F)  +  5}. 
Or,  finally 

e=e+  _  (;^2  +  F  _  7^2^)  1'  +  /i/f  [A^F  -  3(7.2  +  F)  +  5}      +  ^  _  ^  (xxiv.), 

where  more  terms  if  required  can  be  found  by  (xxiii.).  If  6  be  fairly  small,  d''  will  be 
negligible.    Or  if  h  and  k  be  small,  the  lowest  term  in  the  next  factor  will  be     -f  P, 
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and  this  into  O^j  |5  is  generally  quite  insensible.  Very  often  two  or  three  terms  on  the 
right-hand  side  of  (xxiv.)  give  quite  close  enough  values  of  0,  and  accordingly  of 
r  =  sin  6.  (xxiv.)  is  clearly  somewhat  more  convergent  than  (xix.)  if  h  and  k  are,  as 
usually  happens,  less  than  unity. 

Returning  now  to  (xix.),  let  us  write  it 

e=f{r,h,k). 

This  is  the  equation  that  must  be  solved  for  r.  Suppose  a  root  of  this  when  we 
retain  only  few  terms  on  the  right,  say  a  root  of  the  quadratic 

e  =  r  +  \hkr^. 

Then  if  r  =  r^^  -\-  p, 

e  =  f{r„  h,  h)  -f  pf'{r,,  h,  k)  +  ^pTK  h  k)  +  &c. 

Hence  p  =  ^  to  a  third  approximation 


=  -~^l      ::r;iI^-T.;^  nearly  (xxv.), 


l-ro 


which  gives  us  a  value  of  p  which,  substituted  in  p^  in  the  above  equation,  introduces 
only  terms  of  the  6th  order  in  r,^. 

Another  integral  expression  for  e  of  Equation  (xxi.)  may  here  be  noticed  : 

Jo\/l  — r- 

Hence 

Jo  a/1  —r^ 
Jo  \/l  —  r^- 


^— .  or.  r  =  oor9  rh 


1  —  1 

Let  tan  26  =  ,  or,  r  =  cos2  6. 

'       1  +  r  ~ 

Therefore 


J  1  1  +  ■»2 


where  w  =  cot  (/»  and  is  >  1. 

It  seems  possible  that  interesting  developments  for  e  might  be  deduced  from  this 
integral  expression. 

VOL.  cxcv.  — A.  o 


10 


PROFESSOR  K.  PEARSON  ON  MATHEMATICAL  CONTRIBUTIONS 


§  (3.)  To  shoiv  that  the  Series  for  r  is  Convergent  if  r  <  1,  ivhatever  he  the  Values 

of  h  and  k. 

Write  the  series  in  the  form  of  p.  6,  i.e.  : — 


Now 

_Vn+i  =  hv„  —  nv^_^ 
w,,^i  —  hv„  —  nw„_^ 

From  these  we  deduce 


by  (xii.)  and  (xiii.). 


=  {h^  -  {2n  -  1)}  v,_,  -  {n  ~l){n-  2)  v,,_, 
Wu+i  =  {P  —  (2n  —  1)]  w,_^  —  (n  —  1)  (n  —  2)~iv,,_^ 

Now  let  s„  =  v^_-^r"'l {\7i}\  t„  =rwa-iT^"^  \\n]K 

Then  we  find 


v/(«+  1)0^  +  2)  V  n(n  +  l){n  +  2)  '"-^"^  ' 

,  _  /^^-  -  (2n  -  1)  ^  _  /  {n-  l){n  -  2)  ^ 
'"^2  ~  ^/0^  +  l)(7.+  2)  V  n(n  +  l)in  +  2)  ''"'^'^  ' 

Thus,  when  n  is  large,  we  find  the  ratio  of  successive  terms  s^^^jsn  or  tj^^Jt,,  is  given 
by  p,  where 

p  =  —  2r  —        or,  p  =  —  r. 

The  ultimate  ratio  of  s„^ci  t,,^^  to  s„  t,^  is  accordingly  given  by  r",  but  this  is  the 
ratio  of  alternate  terms  of  the  original  series.  The  original  series  thus  breaks  up 
into  two  series,  one  of  odd  and  one  of  even  powers  of  r.  Both  these  series  are 
absolutely  convergent  whatever  h  and  k  be,  having  an  ultimate  convergence  ratio  of  r  ^ 


§  (4.)  To  find  the  Prohahle  Error  of  the  Correlation  Coefficient  as  Determined  hy  the 

Method  of  this  Memoir. 

Given  a  division  of  the  total  frequency  N  into  a,  6,  c,  d  groups,  where 
a-f6  +  c  +  f^  =  N,  then  the  probable  error  of  any  one  of  them,  say  a,  is  '67449  a,,, 
where* 

=   (-vi.). 

Let  h  -\-  d  =■  n^,  c  -\-  d  =  n^,  then 

*  The  standard  deviation  of  an  event  which  happens  np  times  and  fails  nq  times  in  n  trials  is  well 
known  to  be  Jnpq.  The  probable  errors  here  dealt  with  are  throughout,  of  course,  those  arising  from 
different  samples  of  the  same  general  population 
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To  obtain  r,d  we  have,  if  h-q  denotes  an  error  in  any  quantity  17, 

Sc  -|-  8(i  =  8n2, 
.'.        -\- a-J^  ^- ^a-c(Tarcd  =  (T„^  (xxviii.), 


by  squaring,  summing  for  all  possible  variations  in  c  and  d,  and  dividing  by  the  total 
number  of  variations. 

Hence,  substituting  the  values  of  the  standard  deviations  as  found  above,  we 


deduce 

0-,0-^7',^  =  -  C6//N    .     .     ...     .     .  (xxix.). 

In  a  similar  manner 

(rd<T,rdn,  =  d{a-^c)l^  (xxx.). 

and  (TdO-nrd,,,=  d  {a +  h)ll^  (xxxi.), 


Now  =  -7=-  e~^-''\lx, 

Thus  (Xn,=  NHo-/,  (xxxii.), 

and  similarly  cr„^  =  NK  a-/,  (xxxiii.). 

Hence  the  probable  error  of  h 

^^^wrm^IIi: ,  ...  .  .(_iv.). 

and  of  i  ■  =  ^(ZIW)  .  .(..XV.). 

They  can  be  found  at  once,  therefore,  when  H  and  K  have  been  found  from  an 
ordinate  table  of  the  exponential  curve,  and  a,  h,  c,  d  are  given.  We  have  thus  the 
probable  error  of  the  means  as  found  from  any  double  grouping  of  observations. 

Next,  noting  that 

,        ,  ,    .  Swl  S.n2  =  N2HKS/l8^^  ,    .  • 

we  have  o-„jO-„^r„_„^  =  N^HK  o-/,o-/,?v.z, 
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But  8%  =  (8&  +  Sd)  (Sc  +  Bd), 

0-»,^0'>,,1\n,  =  O-bO-cVbc  +  <Tb(T lined  +  (TeO-d  Ted  +  Cr/, 

ad  —  he  /         •  \ 

=     N (xxxvi.), 

therefore 

ad  —  he  . 

O-hO-kThk  =    -j^jjg-  (XXXVII.). 

ad  —  he  ,         ...  . 

n/.  =  ,   ,w    ,  ,  ,   (xxxvm.). 

This  is  an  important  result ;  it  expresses  the  correlation  between  errors  in  the 
position  of  the  means  of  the  two  characters  under  consideration.  But  if  the  prob- 
abilities were  independent  there  could  be  no  such  correlation.  Thus  r/^^  might  be 
taken  as  a  measure  of  divergence  from  independent  variation.  We  shall  return  to 
this  point  later. 

Since  Sn^  =  —  HN8A,  we  have  hn^hd  =  —  HN8c/S/i,  whence  we  easily  deduce 

=  —Tdh  (xxxix.). 

Similarly  r^n,  =  —         .    .'   .  (xl.). 

Now     is  a  function  of  r,  h,  and  k.    Hence  if  d  =  f{r,  h,  k), 

dr            dh  dk 
=  yo^^  +  yi8/i  +  y^Bk  (xH.). 

Whence  transposing,  squaring,  summing,  and  dividing  by  the  total  number  of 
observations,  we  find 

y^ar^  =  a-/  +  yiV/,^  +  ya^V/  —  2y^(Ta(rurak  —  ^y^adO-kTM 

+  ^y^y^cr/.o-kT/d 

+  2(^)  <rd  <Tn.,  'V,,  +  1^  cr,,  o-,,n,,  (xlii.). 

Substituting  the  values  of  the  standard  deviations  and  correlations  as  found  above, 
we  have 

<^?  =  ^.[  d(a  +  6  +  c)  +  (^)'(«  +  6)  (cJ  +  c)  +  {^^"j  (a  +  c)  {d  +  h) 

+  he)  +  ^d{b  +  a)+l^d(c  +  a)]  .    .    .  (xliii.). 
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It  remains  now  to  determine  Jq,  y^,  and  72- 
By  Equation  (i.) 

d  =f(r,  h,  k)  =  a^^f— J"  [y^^^^i^dxdy, 

=  —7. — Tr=^.^         ^  ' ay 


=  -  H  ^  ^e-'^^  dz  (xliv.), 

where  ^2  =  ^^^^=2  • 

Thus 

=  V'2-i  '  ■  ■  (xiv.). 

Similarly  y2/(NK)  =  i//,  - 1  ......        .    .  (xlvi.). 

1    ri^i  1 
Here  =  — 7^--     e~^"'c?2;,  i//2=— 7^     e~^''c?2;  (xlvii,), 

V'  ^TT  Jo  V  -^T''  Jo 

where  ^1  =  y^^,       ^3  =  ^^=1  (xlviii.), 

and  thus  and  can  be  found  at  once  from  the  tables  when  and  ^.^  are  found 
from  the  known  values  of  r,  h,  k. 

Lastly,  we  have  from  Equation  (xxi.) 


ad  —  be 
N^HK 


Jo 


d  (d  +  b){d+c)     ,       1  rTT7 

N  =  — w —  +  2;;]o™'- 

Thus*  yo  =  cZ//cZr  =  ^  NU, 

yo/N  =  xo> 

where  =  ^  -7==.  e-2-(r^(''^  +  ^'^-^'-^^)   (xlix.) 

a  value  which  can  again  be  found  as  soon  as  r,  h,  h  are  known,        =  ^qN  is  clearly 
the  ordinate  of  the  frequency  surface  corresponding  to  x  =  h,  y  =  k. 
Substituting  in  Equation  (xliii.)  we  have,  after  some  reductions, 

*  By  Equations  (ii.)  and  (iii.),  d  +  b  and  d  +  c  are  independent  of  ?'. 
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Probable  error  of  r  =  '674490-,. 
•67449  J  (g  +  d)(c-i  b)  ^  ^1  3  {a  +  c){d+  h)  ,    ,  ^  (i^  +  c) 


V^N^  1         4N3         +"^-2  +V  ^^ 

+  2^//,./,,  —  t/;^  -^^^  I  ^  (l), 

where  Xoj  '/'u  ^-i^d  i//^  are  readily  found  from  Equations  (xlix.),  (xlvii.),  and  (xlviii.). 
Thus  the  probable  error  of  r  can  be  fairly  readily  found.  It  must  be  noted  in  using  this 
formula,  that  a  is  the  quadrant  in  which  the  mean  falls,  so  that  h  and  k  are  both 
positive  (see  fig.,  p.  2).  In  other  words,  we  have  supposed  a  ■\-  c>  h  +  d  and 
a  -\-  h  >  c,  -\-  d.  Our  lettering  must  always  be  arranged  so  as  to  suit  this  result 
before  we  apply  the  above  formula. 


§  (5.)  To  Find  a  Physical  Meaning  for  the  Series  in  r,  or  for  the  e  of  Equation  (xxi.). 

Return  to  the  original  distribution  ~rj^  of  p.  2.  If  the  probahilities  of  the  two 
characters  or  organs  were  quite  independent,  we  should  expect  the  distribution 

a  +  I  h  -\-  d 


^  a  +  b  a  +  c 


^  c  +  d  a  +  c 


N 


N  N 


c  +  d  b  +  d 


Now  re-arranging  our  actual  data  we  may  put  it  thus  : 


^a  +  ba  +  c      ad  —  be 
a  \  b  _      ~N       1^  N 


d        ^  c  +  d  a  +  c       ad  —  be 


a  +  b  b  +  d      ad  —  be 


■^e  +  db  +  d      ad  —  be 


Accordingly  correlation  denotes  that  "'^-^     has  been  transferred  from  each  of  the 

second  and  fourth  compartments,  and  the  same  amount  added  to  each  of  the  first  and 
third  compartments.  If  =  (acZ  —  &c)/N^,  then  iq  is  the  transfer  per  unit  of  the  total 
frequency.  The  magnitude  of  this  transfer  is  clearly  a  measure  of  the  divergence  of 
the  statistics  from  independent  variation.  It  is  physically  quite  as  significant  as  the 
correlation  coefiicient  itself,  and  of  course  much  easier  to  determine.  It  must  vanish 
with  the  correlation  coefficient.    We  see  from  (xxi.)  that 

>7  =  e  X  HK, 

or  we  have  an  interpretation  for  the  series  in  r  of  (xix,). 

Now,  obviously  any  function  of  rj,  just  like  r)  itself,  would  serve  as  a  measure  oi 
the  divergence  from  perfectly  independent  variation.  It  is  convenient  to  choose  a 
function  which  shall  lie  arithmetically  between  0  and  1. 
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Now  consider  what  happens  in  the  case  of  perfect  correlation,  i.e.,  all  the  observa- 
tions fall  into  a  straight  line.  Hence  if  ad  >  he,  either  &  or  c  is  zero,  for  a  straight 
line  cannot  cut  all  four  compartments,  and  a  and  d  are  obviously  positive.  Thus  c 
and  h  can  only  be  zero  if  77  =  (c  +  d){a  +  c)/W  or  {a  +  h){b  +  d)JW.  In  order 
that  h  should  be  zero,  it  is  needful  that  h  and  k,  as  given  by  (iv.)  and  (v.),  should  be 
positive  or  a  +  c  >  h  +  d,  a  +  b  >  c  -\-  d,  and  the  mean  fall  under  the  45°  line 
through  the  vertical  and  horizontal  lines  dividing  the  table  into  four  compartments, 
i.e.,  h  >  k  These  conditions  would  be  satisfied  it  ad  >hc  and  a  >  d,  c  >  h.  Now 
suppose  our  four- compartment  table  arranged  so  that 

ad>hc,    a>d,  c>h, 

and  consider  the  function 
or 

Q.    TT       ad  —  be  /v  \ 

1  =  sm  -   (ill-)- 

This  function  vanishes  \{  rj  =  0,  and  it  further  =  unity  if  6  =  0,  Thus  it  agrees 
at  the  limits  0  and  1  with  the  value  of  the  correlation  coefficient.    Again,  when  h 

and  k  are  both  zero,  a  =  d,  h  =  c,  and  Q-,  =  sin  y  ^  _^  ^,  is  thus  r  by  (xx.).  Hence 

we  have  found  a  function  which  vanishes  with  r  and  equals  unity  with  r,  while  it  is 
also  equal  to  r  if  the  divisions  of  the  table  be  taken  through  the  medians. 

Now,  I  take  it  that  these  are  very  good  conditions  to  make  for  any  function  01 
a,  h,  c,  d  which  is  to  vanish  with  the  "  transfer,"  and  to  serve  as  a  measure  of  the 
degree  of  dependent  variability,  or  what  Mr.  Yule  has  termed  the  degree  of 
"  association."    Mr.  Yule  has  selected  for  his  coefficient  of  association  the  expression 

„       ad  —  be  > 

Q'  =  ST^  ('"■•)• 

This  vanishes  with  the  transfer,  equals  unity  if  6  or  c  be  zero,  and  minus  unity  if  a 
or  d  be  zero.  The  lafcter  is,  of  course,  unnecessary  if  we  agree  to  arrange  a,  5,  c,  d 
so  that  ad  is  always  greater  than  he.  Now  it  is  clear  that  Qo  possesses  a  great 
advantage  over  in  rapidity  of  calculation,  but  the  coefficient  of  correlation  is  also 
a  coefficient  which  measures  the  association,  and  it  is  a  great  advantage  to  select  one 
which  agrees  to  the  closest  extent  with  the  correlation,  for  then  it  enables  us  to 
determine  other  important  features  of  the  system. 

If  we  do  not  make  all  the  above  conditions,  we  easily  obtain  a  number  of  coeffi- 
cients which  would  vanish  with  the  transfer.  Thus  for  example  the  correlation  r^^.  of 
Equation  (xxxviii.)  is  such  an  expression.'^  It  has  the  advantage  of  a  symmetrical 
form,  and  has  a  concise  physical  meaning.    It  does  not,  however,  become  unity  when 

*  In  fact  (xxxvii.)  gives  us  e  =  o-ho-krhk- 
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either,  but  not  both,  h  and  c  vanish,  nor  does  it,  unless  we  multiply  it  by  7r/2  and 
take  its  sine,  equal  the  coefficient  of  correlation  when  a  =  cl  and  b  =  c. 

Again,  we  might  deduce  a  fairly  simple  approximation  to  the  coefficient  of  correla- 
tion from  the  Equation  (xxiv.)  for  9,  using  only  its  first  few  terms.    Thus  we  find 

Q-    £>  ad  —  be  n-  \ 

N^l  -  K%x^  +  %/)}  +  ■^X.XMI  -  be)    •    ■    ■    '    '  Uiv-)' 


,  /tt  (a  +  c)  -(b  +  d) 

where  ;Yi  =  \/ 

X2  =  -x/ 


2  N 

TT  (rt  +  b)  —  (c  +  d) 


as  an  expression  which  vanishes  with  the  transfer,  and  will  be  fairly  close  to  the 
coefficient  of  correlation.    It  is  not,  however,  exactly  unity  when  either  6  or  c  is 
zero.    But  without  entering  into  a  discussion  of  such  expressions,  we  can  write 
several  down  which  fully  satisfy  the  three  conditions : — 
(i.)  Vanishing  with  the  transfer. 

(ii.)  Being  equal  to  unity  if  6  or  c  =  0. 

(iii.)  Being  equal  to  the  correlation  for  median  divisions. 
Such  are,  for  example  : — 

=       2  ^ad  -  ^be 

TT  1 

Q4  =  sin2f:f^  2Sc  ^,ad>lc    .    .    .  (Ivi.), 

(ad  —  be)  (b  +  c) 

Q-=^^^f7rT^  •  •  ■ 

„  4:abcdW 
where  k      ■   • 


(ad  -  hcf  (a  +  d)  (b  c) 


Only  by  actual  examination  of  the  numerical  results  has  it  seemed  possible  to  pick 
out  the  most  efficient  of  these  coefficients.  was  found  of  little  service.  The 
following  table  gives  the  values  of  Q.,,  Qg,  Q^,  and  Q5  in  the  case  of  fifteen  series 
selected  to  cover  a  fairly  wide  range  of  values  : — 


TO  THE  THEORY  OF  EVOLUTION. 


17 


No. 

r. 

h. 

n 

Hi- 

1 

•5939  + 

•0247 

■0873 

— 

•4163 

•7067 

•6054 

■6168 

•6100 

2 

•5557  ± 

•0261 

•4189 

— 

•4163 

•6688 

•5657 

■5405 

•5570 

3 

•5529  + 

•0247 

•0873 

— 

•0012 

•6828 

•5809 

■5699 

•5813 

4 

■5264  + 

•0264 

+ 

•2743 

+ 

•3537 

•6345 

•5331 

■5200 

•5283 

5 

•5213  ± 

•0294 

+ 

•6413 

+ 

•6966 

•6530 

•5511 

•4878 

•5160 

6 

■5524  ± 

•0307 

+ 

1^0234 

+ 

■3537 

■7130 

•6118 

•6169 

•6138 

7 

•5422  + 

•0288 

+ 

■6463 

+ 

■5828 

■6693 

•5673 

•5136 

•5452 

8 

•2222  + 

•0162 

+ 

■3190 

+ 

■3190 

■2840 

•2268 

•2164 

•2251 

Q 

y 

•3180  + 

•0361 

+ 

■1381 

+ 

•0696 

■3959 

■3185 

•3176 

•3183 

10 

■5954  ± 

•0272 

+  1-5114 

+ 

•7^14 

■7860 

■7100 

•6099 

■6803 

11 

■4708  ± 

■0292 

+ 

•0865 

•0054 

■5692 

•4712 

•4720 

•4715 

12 

■2335  + 

■0335 

+ 

•0405 

+ 

•0054 

■2996 

•2385 

•2385 

•2385 

13 

■2451  ± 

■0205 

+ 

•2707 

+ 

•0873 

■3103 

•2473 

•2456 

•2470 

14 

•1002  ± 

■0394  ' 

+ 

•4557 

+ 

•1758 

•1311 

•1032 

•0993 

•1029 

15 

■6928  ± 

■0164 

+ 

•5814 

+ 

•5814 

•8032 

•7108 

•6699 

■6897 

Now  an  examination  of  this  table  shows  that  notwithstanding  the  extreme  ele- 
gance and  simplicity  of  Mr.  Yule's  coefficient  of  association  Qo,  the  coefficients  Q3, 
Qi,  and  Q5,  which  satisfy  also  his  requirements,  are  much  nearer  to  the  values 
assumed  by  the  correlation.  I  take  this  to  be  such  great  gain  that  it  more  than 
counterbalances  the  somewhat  greater  labour  of  calculation.  If  we  except  cases  (6) 
and  (10),  in  which  h  or  k  take  a  large  value  exceeding  unity,  we  find  that  Q3,  Q^,  and 
Q5  in  the  fifteen  cases  hardly  differ  by  as  much  as  the  probable  error  from  the  value 
of  the  correlation.  If  we  take  the  mean  percentage  error  of  the  difference  between 
the  correlation  and  these  coefficients,  we  find 

Mean  difference  of  Q3  =  24 "38  per  cent. 
.  jj         55      Q3  —    3'95  ,, 
J)       ■  )>  —    2  94 

3J         ))      Qj  —   2'72  ,, 

Thus  although  there  is  not  much  to  choose  between  and  Q5,  we  can  take  Q5  as 
a  good  measure  of  the  degree  of  independent  variation. 

The  reader  may  ask  :  Why  is  it  needful  to  seek  for  such  a  measure  ?  Why  cannot 
we  always  use  the  correlation  as  determined  by  the  method  of  this  paper  ?  The 
answer  is  twofold.  We  want  first  to  save  the  labour  of  calculating  r  for  cases  where 
the  data  are  comparatively  poor,  and  so  reaching  a  fairly  approximate  result  rapidly. 
But  labour-saving  is  never  a  wholly  satisfactory  excuse  for  adopting  an  inferior 
method.  The  second  and  chief  reason  for  seeking  such  a  coefficient  as  Q  lies  in  the 
fact  that  all  our  reasoning  in  this  paper  is  based  upon  the  normality  of  the  frequency. 
We  require  to  free  ourselves  from  this  assumption  if  possible,  for  the  difficulty,  as 
is  exemplified  in  Illustration  V.  below,  is  to  find  material  which  actually  obeys 
within  the  probable  errors  any  such  law.  Now,  by  considering  the  coefficient  of 
regression,  ra-Ja-^^  =  S{xy)/(Naia-2),  as  the  slope  of  the  line  which  best  fits  the  series 
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of  points  determined  as  the  means  of  arrays  of  x  for  given  values  of  y,  we  have  once 
and  for  all  freed  ourselves  from  the  difficulties  attendant  upon  assuming  normal 
frequency.  We  become  indifferent  to  the  deviations  from  that  law,  merely  observing 
how  closely  or  not  our  means  of  arrays  fall  on  a  line.  When  we  are  not  given  arrays 
but  gross  grouping  under  certain  divisions,  we  have  seen  that  the  "  transfer  "  is  also 
a  physical  quantity  of  a  significance  independent  of  normality.  We  want  accordingly 
to  take  a  function  which  vanishes  with  the  transfer,  and  does  not  diverge  widely 
from  the  correlation  in  cases  that  we  can  test.  Here  the  correlation  is  not  taken  as 
something  peculiar  to  normal  distributions,  but  something  significant  for  all  distribu- 
tions whatever.    Such  a  function  of  a  suitable  kind  appears  to  be  given  by  Qg. 

§  6.  On  the  "  Excess  "  and  its  Relation  to  Correlation  and  Relative  Variability. 

There  is  another  method  of  dealing  with  the  correlation  of  characters  for  which 
we  cannot  directly  discover  a  quantitative  scale  which  deserves  consideration.  It 
is  capable  of  fairly  wide  application,  but,  unlike  the  methods  previously  discussed,  it 
requires  the  data  to  be  collected  in  a  special  manner.  It  has  the  advantage  of  not 
applying  only  to  the  normal  surface  of  frequency,  but  to  any  surface  which  can  be 
converted  into  a  surface  of  revolution  by  a  slide  and  two  stretches. 

It  is  well  known  that  not  only  the  normal  curve  but  the  normal  surface  has  a 
type  form  from  which  all  others  can  be  deduced  by  stretching  or  stretching  and 
sliding.  Thus  in  1895  the  Cambridge  Instrument  Company  made  for  the  instrument 
room  at  University  College,  London,  a  "  biprojector,"  an  instrument  for  giving 
arbitrary  stretches  in  two  directions  at  right  angles  to  any  curve.  In  this  manner 
by  the  use  of  type-templates  we  were  able  to  draw  a  variety  of  curves  with  arbi- 
trary parameters,  e.g.,  all  ellipses  from  one  circle,  parabolas  from  one  parabola, 
normal  curves  from  one  normal  curve  template.  Somewhat  later  Mr.  G.  U.  Yule 
commenced  a  model  of  a  normal  frequency  surface  on  the  Brill  system  of  inter- 
laced curves.  This,  by  the  variable  amount  of  slide  given  to  its  two  rectangular 
systems  of  normal  curves,  illustrated  the  changes  from  zero  to  perfect  correlation. 
This  model  was  exhibited  at  a  College  soiree  in  June,  1897.  Geometrically  this 
property  has  been  taken  by  Mr.  W.  F.  Sheppard  as  the  basis  of  his  valuable  paper 
on  correlation  in  the  '  Phil.  Trans.,'  A,  vol.  192,  pp.  101-167.  It  is  a  slight  addition 
to,  and  modification  of,  his  results  that  I  propose  to  consider  in  this  section. 

The  equation  to  the  normal  frequency  surface  is,  as  we  have  seen  in  §  1, 

Now  write  x/[a-i\/l  —  r^)  =  x' ,  y/a^  =  y'.  This  is  merely  giving  the  surface  two 
uniform  stretches  (or  squeezes)  parallel  to  the  coordinate  axes.  We  have  for  the 
frequency  of  pairs  lying  between  x,  x      Sx,  and  y,  S  S//, 
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zBxSy  =  2^  Bx'Sy'  expt. 


X  — 


v/1 


Now  give  the  uniform  slide  parallel  to  the  axis  of  x  defined  byr/^1— 

at  unit  distance  from  that  axis.  This  will  not  change  the  basal  unit  of  area 
8a  =  Sx'Sy',  and  analytically  we  may  write 


X  =  x'  -  y'r/x/l -r\       Y  =  y\  ^      +  Y^. 


Whence  we  find 


zhxhy  =      8a  expt.  ( — 


This  is  the  mechanical  changing  of  the  Yule-Brill  model  analytically  represented. 
The  surface  is  now  one  of  revolution,  and  the  proof  would  have  been  precisely  the 
same  if  we  had  written  in  the  above  results  any  function  /,  instead  of  the  exjDo- 
nential.'^  It  is  easy  to  see  that  any  volume  cut  off  by  two  planes  through  the  axis  of 
the  surface  is  to  the  whole  volume  as  the  angle  between  the  two  planes  is  to  four  right 
angles.  Further  the  corresponding  volumes  of  this  surface  and  the  original  surface 
are  to  each  other  as  unity  to  the  product  of  the  two  stretches.  Lastly,  any  plane 
through  the  z-axis  of  the  original  solid  remains  a  plane  through  the  z-axis  after  the 
two  stretches  and  the  slide.  These  points  have  all  been  dealt  with  by  Mr.  Sheppard 
(p.  101  et  seq.,  loc.  cit.).  I  will  here  adopt  his  notation  r  =  cosD,  and  term  with  him 
D  the  divergence.  Thus  cot  D  is  (in  the  language  of  the  theory  of  strain)  the  slide, 
and  D  is  the  angle  between  the  strained  positions  of  the  original  x  and  y  directions. 
Now  consider  any  plane  which  makes  an  angle  x  with  the  plane  of  xz  before  strain. 
Then,  since  the  contour  lines  of  the  correlation  surface  are  ellipses,  the  volumes  of 
the  surface  upon  the  like  shaded  opposite  angles  of  the  plan  diagram  below  will  be 
equal ;  and  if  they  be  and  Wo,  then  -f-  7^  =  ^N.  If  n^'  and  be  the  volumes 
after  strain,  then  by  what  precedes  we  shall  have 


=  o■^(T^^y  I  —      X  n/, 


He 


=  o-iO-2V^  1  —     X  n^, 


and 


-  n,)/{n,  +  n,)  =  «  -  n,')/{n{  +  n^). 


-X 


*  The  generalisation  is  not  so  great  as  might  at  first  appear,  for  I  have  convinced  myself  that  this 
property  of  conversion  into  a  surface  of  revolution  by  stretches  and  slides  does  not  hold  for  actual  cases 
of  markedly  skew  correlation. 

D  2 
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Now  n{  and  will  be  as  the  angles  between  the  strained  positions  of  the  planes 
bounding  n-^  and  n^.  Ox  does  not  change  its  direction.  Oi/  is  turned  through  an 
angle  7r/2  —  D  clockwise,  and  x  becomes  x'\  say.  Hence 

n,  :  <  : :  -  _  X '  +  ^  -  D  :  2  +  X  -  2  + 
«  -  O/K  +  <)  =  I       +  D)  -  1. 


or 


Let  us  write  =  2(%  —  n{}  and  term  it  the  excess  for  the  y-character  for  the 
line  AB,    Then  we  easily  find  : 

/E^  TT     7r\  /  '/  I  -r\\       cot  y"  +  cot  D  ...  . 

tan  [^2+2)  =  ^'^  (X  +  D)  =      ^.      p  _  1     ....  (l™.). 

It  remains  to  determine  tan  x'  and  substitute.  The  stretches  alter  tan  x  into 
tan  x',  such  that 


tan  x'  =        r  tan  x- 


Further,  by  the  slide 


cot  v"  =  cot  y'  —  cot  D  =   ^       cot  y  —  cot  D. 

Hence  we  have  by  (Iviii.)  above 

-      (n  t)  =  a^f^^  ""^^  X  cot  D  -  cot^  D  -  1)  , 

or, 

-tan(|f)  =  ootD-^i*^  (lix.). 

Now  the  excess  is  the  difference  of  the  frequencies  in  the  sum  of  the  strips  of 
the  volume  made  by  planes  parallel  to  the  jDlane  yz  on  the  two  sides  of  the  plane  KEz 
(defined  by  x),  taken  without  regard  to  sign.  For  on  one  side  of  the  mean  yy  this  is 
^2  —  and  on  the  other  —[ui  —  no^.  Hence  we  have  this  definition  of  E^,  the 
column  excess  for  any  line  through  the  mean  of  a  correlation  table  :  Add  up  the 
frequencies  above  and  below  the  line  in  each  column  and  take  their  differences  ivithout 
regard  to  sign,  and  their  sum  is  the  column  excess. 

If  we  are  dealing  with  an  actual  correlation  table  and  not  Math  a  method  of 
collecting  statistics,  then  care  must  be  taken  to  properly  proportion  the  frequencies 
in  the  column  in  which  the  mean  occurs,  and  also  in  the  groups  which  are  crossed  by 
the  line.  It  is  the  difficulty  of  doing  this  satisfactorily,  especially  if  the  grouping,  as 
in  eye  and  coat  colour,  is  large  and  somewhat  rough,  that  hinders  the  effective  use  of 
the  method,  if  the  statistics  have  not  been  collected  ad  hoc. 

Now  let  Eo  be  the  row  excess  for  the  line  AB,  defined  in  like  manner,  then  we  have 
in  the  same  way 
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-ta.(||)  =  ootD-=;:|^  (Hx.«.). 

Now  eliminate  cr^/a-i  between  (lix.)  and  (lix.^'^)  ;  then 

(tan  (I  I)  +  cot  D)  (tan  (|  |)  +  cot  D)  =  .-j^  • 


Whence  we  deduce 
and,  therefore, 


cot  1)  =  cot   -  , 

=  COS  D  =  COS  "^^  ^  "^^  ^  .    .    .  ■  (Ix-)- 


Substituting  for  D  in  (hx.)  we  find  further 


^  =  cotxcos(||V-s(^M  (Ixi.). 


N  2  11  ""^VN  2 

Thus  Equations  (Ix.)  and  (Ixi.)  give  the  coefficient  of  correlation  and  the  relative 
variability  of  the  two  characters.  The  latter  is,  I  believe,  quite  new,  the  former  novel 
in  form. 

If  we  call  the  frequency  in  the  angle  x  (AOx  of  the  figure  above),  then  it  is  easy 
to  see  that  =  2{n.-^  —  i\)  =  N  —  4.ny,  and  similarly  E^  =  N  —  4wj.  Thus 
(E^  +  E.)/N  =  2(N  —  2(i2i  +  Wi))/N.  But  +  is  the  frequency  in  the  first 
quadrant.  This  Mr.  Sheppard  terms  P,  while  that  in  the  second  he  terms  R.  We 
have  thus  (E^  +  Ej/N  =  2R/(R  +  P),  or 

R 

i.e.,  Mr.  Sheppard's  fundamental  result*  ('  Phil.  Trans.,'  A,  vol.  192,  141). 

We  can,  of  course,  get  Mr.  Sheppard's  result  directly  if  we  put  x  =  when  we 
have  at  once  E^  =  2(R  —  P),  Eg  =  N  =  2(R  +  P),  and  the  result  follows. 

Equation  (Ixi.)  may  also  be  written  in  the  form 

^  =  cotxsin('^^27r)/sin('^27r)  (Ixiii.). 

If  we  put  X  =  0,  then  becomes  zero,  and  the  right-hand  side  of  (Ixiii.)  is 
indeterminate.  If  we  proceed,  however,  to  the  limit  by  evaluating  the  frequency  in 
an  indefinitely  thin  wedge  of  angle  X;  we  reach  merely  the  identity  cr^/a-j  =  o'l/o's- 
Hence  there  is  no  result  corresponding  to  (Ixi.)  to  be  obtained  by  taking 
Mr.  Sheppard's  case  of  x  =  0. 

The  following  are  the  values  of  the  probable  errors  of  the  quantities  involved : — 


*  In  the  actual  classification  of  data  (Ix.)  and  (Ixii.)  suggest  quite  different  processes.  We  can  apply  (Ix.) 
where  (Ixii.)  is  difficult  or  impossible,  e.g.,  correlation  in  shading  of  birds'  eggs  from  the  same  clutch. 


22 


PROFESSOR  K.  PEARSON  ON  MATHEMATICAL  CONTRIBUTIONS 


Probable  error  of  El  = -67449  yN(l  —  E^^N^)  ....  (Ixiv.). 

E,  =  -67449 VN(1  -E^VN^)  .    .    .    .  (Ixv.). 

Correlation  between  errors  in  E,  and  E,  =  -  \/^^~  ^i/^)^^  ~  gi^^)  nxvi  \ 

^         ^          V  (1  +  E,/N)  (1  +  E./N)  •    •  V^^^'h 


-p    ,   , ,            .           -67449  sin  Dyj5  (tt  -  D)  ..  , 

Jrrobabie  error  m  r  —   ....  (Ixvii.), 

E    -f-  E  TT 

where  D  =   ^      ^  -  (c/  Sheppard,  Zoc.  c^^.,  p.  148). 

Probable  error  in  ratio  crjcr^  = 

•67449  <T,7r\f  ,,%7r\   ,   /       E,2\^  ^/E^ttX 

2        -  N^j*'^^^  (n  27  +      -  N;)        (n  2) 

The  ajDplication  of  the  method  here  discussed  to  statistics  without  quantitative 
scale  can  now  be  indicated.  If  the  characters  we  are  dealing  with  have  the  same 
scale,  although  it  be  unknown,  then,  if  the  quantitative  order  be  maintained,  i.e., 
individuals  arranged  in  order  of  lightness  or  darkness  of  coat  or  eye-colour,  the 
diagonal  line  on  the  table  at  45°  will  remain  unchanged,  however  we  may  suppose 
parts  of  the  scale  to  be  distorted,  for  the  distortion  will  be  the  same  at  corresponding 
points  of  both  axes.  Further,  if  we  suppose  the  mean  of  the  two  characters  to  be  the 
same,  this  45°  line  will  pass  through  that  mean,  and  will  serve  for  the  line  AB  of  the 
above  investigation.  In  this  case  we  must  take  tan  ^  =  1,  and  consequently  (Ixi.) 
becomes 

aja,  =  cos  (I  f )  /cos  (I  ^)  (Ixix.). 


N  2j/  ""^\N  2 

We  can  even,  when  the  mean  is  a  considerable  way  off  the  45°  line,  get,  in  some 
cases,  good  results.  Thus,  the  correlation  in  stature  of  husband  and  wife  worked  out 
by  the  ordinary  product  moment  process  is  "2872.  But  in  this  case  E^  =  382-062 
E^  =  806-425,  and  this  gives  the  correlation  -2994.  On  the  other  hand,  the  actual 
ratio  of  variabilities  is  1-12,  while  (Ixix.)  makes  it  2*76  !  This  arises  from  the  fact 
that  the  errors  in  E^  and  Eo,  due  to  the  mean  being  off  the  45°  line,  tend  to  cancel  in 
E^  +  E^,  but  tend  in  directly  opposite  directions  in  the  ratio  of  the  cosines.  Similarly 
the  correlation  between  father  and  son  works  out  "5666,  which  may  be  compared  with 
the  values  given  in  Illustration  V.  below,  ranging  from  -5198  to  -5939.  Again, 
correlation  in  eye-colour  between  husband  and  wife  came  out  by  the  excess  process 
-0986,  and  by  the  process  given  earlier  in  the  present  Memoir  -1002.  But  all  these  are 
favourable  examples,  and  many  others  gave  much  worse  results.  We  ought  really  only 
to  apply  it  to  find  cr^/o-^  when  the  means  are  on  the  45°  line,  as  in  the  correlation  of  the 
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same  character  in  brethren,  and  even  in  this  case  the  statistics  ought  to  be  collected 
ad  hoc,  i.e.,  we  ought  to  make  a  very  full  quantitative  order,  and  then  notice  for  each 
individual  case  the  number  above  and  below  the  type.  For  example,  sujjpose  we  had 
a  diagram  of  some  twenty-five  to  thirty  eye  tints  in  order  {e.g.,  like  Bertrand's), 
then  we  take  any  individual,  note  his  tint,  and  observe  how  many  relatives  of  a 
particular  class — brethren  or  cousins,  say — have  lighter  and  how  many  darker 
eyes  ;  the  difference  of  the  tM^o  would  be  the  excess  for  this  individual.  The  same 
plan  would  be  possible  with  horses'  coat-colour  and  other  characters.  After  trying  the 
plan  of  the  excesses  on  the  data  at  my  disposal  for  horses'  coat-colour  and  human  eye- 
colour  (which  were  not  collected  ad  hoc),  I  abandoned  it  for  the  earlier  method  of 
this  Memoir  ;  for,  the  classification  being  in  large  groups,  the  proportioning  of  the 
excess  (as  well  as  the  differences  in  the  means)  introduced  too  great  errors  for  such 
investigations. 

§  7.  On  a  Generalisation  of  the  Fundamental  Theorem  of  the  Present  Memoir. 


If  we  measure  deviations  in  units  of  standard  deviations,  we  may  take  for  the 
equation  to  the  correlation  surface  for  n  variables 


,-,(s.(|w).2s.(?^)}  


where 


1 

r 


ro-x 


''•2)1 


32 


3n 


■^"vi— 1,  2      '^'n-l,  3 


n 


n,  I 


r,,  o 


11,  3 


•  1  r,,_^,n 


and  R^,^  is  the  minor  obtained  by  striking  out  the  pth  row  and  qih.  column,  r^^  is,  of 
course,  the  correlation  between  the  pih  and  5th  variables,  and  equals  r,jp.  denotes 
a  summation  for  s  from  1  to  w,  and  a  summation  of  every  possible  pair  out  of  the 
n  quantities  1  to  n. 

Now  take  the  logarithmic  differential  of  z  with  regard  to  r^^.    We  find 


1 

z  dr,, 


n 


\dr.^ 


T'l 


For 


d^ldrp^  =  2E, 


'pq 
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and,  generally",  whether  s  is  or  is  not  =  s',  or  these  are  or  are  not  =  p  and  q,  we 
have 

(I  /  l^s'sA               l^/i'  "t"  R^w  R^i  ,-.    .  » 

^,Uy "  —  

This  follows  thus  : 

d  I  Rjj/^       1  (j^Rjji/      Ris(  f?R       1  c/Rji;  2R^._5,  R^^ 


6^?"^,^  \  11  J       R  R~  clvpj       R  R^ 


or  we  have  to  show  that 


d^SSl  2Rjj;  R/li  Rji'  R/)S'  RjJ 

Rjj'  1^;)?     -Rpi  Ry*'  I  Rjj'  R/)y     l^/js'  Ryi 
R  R 


where  ^.^11^,'  is  the  minor  corresponding  to  the  term  r^^^  in  E,„,,  and  f,j^ss'  the  minor 
corresponding  to  the  term  Vf^p*  But  this  last  result  is  obvious  because  E,jy  only  con- 
tains Vp^  in  two  places,  i.e.,  as  r^q  and  r^p. 

Putting  s  =s',  we  have  the  other  identity  required  above,  i.e., 


d  I RiS \  2Rps  Rys 


di-jic^  \  R  /  R^ 


1  dz 

Returning  now  to  the  value  for  -  — -  on  the  previous  page,  we  see  that  the  two 


sum  terms  may  be  expressed  as  a  product,  or  we  may  put 


Idz^ 
z  dr 


n 


Now  write 


N 

z  = 


(27r)"VR  ■ 

di^       ^  /lips    \      d<f)         „  /R,js    \        ,      d^^    Rpy 


^=S,(t.,).    *  =  S,(t..)  and 
1  dz    d~(j)    ^  d(f)  d^ 

Now  differentiate  log  z  with  regard  to  x^.  Then 

dz  d(f) 


dxp  ci  ti'q 


*  See  also  Scott,  '  Theory  of  Determinants,'  p.  59. 
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dxpdx^  dxpdxg       dxj  dxj, 

f?V    _^  d^d^ 


Thus  finally 


z  dXpdXg  dxpdxj       dXp  dx^ 

dz  dH 


(Ixxii.). 


In  other  words,  the  operator  cZ/c?r^,/  actmg  on  z  can  always  be  replaced  by  the 
operator  d^jdxpdxq. 

Let  d/dpp^  denote  the  elFect  of  applying  the  operator  djdr^  to  z,  and  putting  Vpr, 
zero  after  all  differentiations  have  been  performed,  then  the  effect  of  this  operator  will 
be  the  same  as  if  we  used  d^jdxpdxg  on  z,  putting  r^,/  zero  before  differentiation. 
Generally,  let  F  be  any  series  of  operations  like  d/dvpri,  then  we  see  that 

^  I  d        d  d 
F    — . 


d-Tpq        dVjiiq,  dVpiiqii 

\  dxj/dx^    dxpdxi^,  '     dx,p„dx,p,  ' /  (27r)'" 

Now  let  F  be  the  function  which  gives  the  operation  of  expanding  z  by  Maclaurin's 
theorem  in  powers  of  the  correlation  coefficients,  i.e., 

F  =  .^-'^£7). 

then 

z  =  e  ^^(•"  i')  z  =  .T^)  e  - *«'(-^> . 

This  is  the  generalised  form  of  result  (xiv.)  reached  above. 
Now  let  2o  =  e-i^'<--^'\ 

then  Zq  is  the  ordinate  of  a  frequency  surface  of  the  7v,th  order,  in  which  the  distribution 
of  the  n  variables  is  absolutely  independent.  "We  have  accordingly  the  extremely 
interesting  geometrical  interpretation  that  the  operator 

applied  to  a  surface  of  frequency  for  n  independent  variables  converts  it  into  a  surface 
of  frequency  for  n  dependent  variables,  the  correlation  between  the  sth  and  s'th 
variables  being  r^^.* 

*  I  should  like  to  suggest  to  the  pure  mathematician  the  interest  which  a  study  of  such  operators  would 
have,  and  in  particular  of  the  generalised  form  of  projection  in  hyperspace  indicated  by  them. 
VOL.  CXCV. — A.  E 
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ExjDanding,  we  have 


«  =  ^0  +  H  ^rs'  j^O  +        i  S3  (  9  V 


1 


+  •  •  •  +  ^0  +  (Ixxiii.). 


Our  next  stage  is  to  evaluate  the  operation 


Let  us  put 


dxM'j  °" 


and      =  the  ^th  function  of         defined  by  (xv.). 

Let     be  a  symbol  such  that  e/  represents  />^p-    Then  we  shall  show  that 


A''''  c/x/L'-J  "0 


We  shall  prove  this  by  induction. 
By  (xii.) 


(Ixxiv, ). 


or 

and  by  (xvi.) 


dsV„ 


^  7ll  +  \    £  » 


=  m  or 


del 


=  m  e  J 


Now,  let  X  {^s)  be  any  function  of 


=  S(A,e/), 

if  we  suppose  it  can  be  expanded  in  powers  of  e^. 
Then 

ix(e.)  =  s(A,4(e/) 


=  S(A,2e/-i) 

=  a;,S(A^  e/)  -     S(A,  e/) 

=  (aj,  -  e,)  x(e.)  (Ixxv.). 


Similarly 
Now  suppose  that 


(Ixxvi.). 
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then 


where  U  stands  for  {S^irss'^se,)}"'. 

Hence,  remembering  that  dzjdxs  =  —  ZqXs,, 


r       /  \-|m+l  /        d?\J  \ 


which  had  to  be  proved. 

But  it  is  easy  to  show  by  simple  differentiation  that 


=  2o  {So(>Ve,e,,)]^  (Ixxvii.). 


Hence  the  theorem  is  generally  true. 
Thus  we  conclude  that 


\ 


1  r  1  1 

+  .  .  .  +  -  I  S2(nye,e,,)|  +  (Ixxviii.) 


It  is  quite  straightforward,  if  laborious,  to  write  down  the  expansion  for  any  number 
of  variables. 

Now  let  Q  be  the  total  frequency  of  complices  of  variables  with  x^  lying  between 
and  oo  ,  X2  between    and  co  ,  .  .  .  x^  between  hs  and  co  ^  .  .  .  x,,  between     and  00  ; 

and  let  Qq  be  the  frequency  of  such  c^Jlnplices  if  there  were  no  correlations. 

Then 

/.QO      /.CO  f<X>  /•CO 

Q=      j...l...2:  dx^  dx.2  .  .  .  dxg  .  .  .  dx„ 

J  hi  J  hi  J  ha  J  ft,. 

Qq  ^ —  I    I  ...  I  ...  I  Zq  dx-y  dx^  .  •  .  dxg  ,  .  .  dxji 

Now  let 

S/  '^T^  ih, 

E  2 
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where 


We  have  =  NyS^yS,  .  .  A  •  •  A.  .  .  H,  .  .  H„. 

But  by  (xviii.) 


where 

and  as  above, 


=  [.V^-lJ^.^,,  (Ixxx.), 

/3s  =  ^^e-'^'^'dx.J^e-i'"'  (Ixxxi.). 

Thus 


(^2 


or 


.  .  .  H,  .  .  .  R,,I3,I3,  .  .  .  A  .  .  .  A/- 


Sl'^p'-l  s"^p"—\ 


/3.    A'  A" 


•        ......     ZqU{sVp)  dx^dx^  .  .  .  dx,  .  .  .  dx,,  =  QqUI'^ 


(Ixxxii.). 


where  n  denotes  a  product  of  ^-i'^;  for  any  number  of  -y's  with  any  s  and  p.    The  rule, 
therefore,  is  very  simple.    We  must  expand  the  value  of  z  in  ?;'s  as  given  by  (Ixxviii.) 
above,  then  the  multiple  integral  of  this  will  be  obtained  by  lowering  every  v's  right- 
hand  subscript  by  unity  (remembering  that      =1),  and  further  dividing  by  the  of 
the  left-hand  subscript.    The  general  expression  up  to  terms  of  the  fourth  order  has 
been  written  down  ;  it  involves  thirty-four  sums,  each  represented  by  a  type  term 
All  these  would  only  occur  in  the  case  of  the  correlation  of  eight  organs,  or  when  we 
have  to  deal  with  twenty-eight  coefficients  of  correlation.     Such  a  number  seems 
beyond  our  present  power  of  arithmetical  manipulation,  so  that  T  have  not  printed  the 
general  expressions.    At  the  same  time,  the  theory  of  multiple  correlation  is  of  such 
great  importance  for  problems  of  evolution,  in  which  over  and  over  again  we  have 
three  or  four  correlated  characters  to  deal  with,*  that  it  seems  desirable  to  place 
on  record  the  expansion  for  these  cases.    I  give  four  variables  up  to  the  fourth  and 
three  variables  up  to  the  fifth  order  terms.    Afterwards  I  will  consider  special  cases. 


*  In  my  memoir  on  Prehistoric  Stature  I  have  dealt  with  five  correlated  organs,  ten  coefficients.  In 
some  barometric  investigations  now  in  hand  we  propose  to  deal  with  at  least  fifteen  coefficients,  while 
Mr.  Bramley-Moore,  in  the  correlation  of  parts  of  the  skeleton,  has,  in  a  memoir  not  yet  pul)lished,  dealt 
with  between  forty  and  fifty  cases  of  four  variables  or  six  coefficients. 
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Value  of  the  Quadruple  Integral  in  the  Case  of  Four  Variables* 


1    ^^'14^^24        iv     1  ^^23^^24 


iv     1      2%*V4  //I 

^1    +  A/33;8.  J 


24     „.  -^^iv 


_\/3,^,  ^^^^  +  Ms    '  '    +  ^1/3.   ''''    +                    +  M, 

1               „  iv     I     ^M2  hs       /     "     ,         13^13        /            I              ^4      '  /" 

+  AiWi   ''^''^  ^ 

,        ^^13^'24^  iv     I                          iv.    '     I      ^^^4^24^       iv,,  " 

,     ^^'23^  ^'24  //     /"     ,     ^^Vji!       "     iv     ,        ^^Vl2^         /  // 

I       "^^34  ^2  ///     iv     ,     ^^13  ^34       ///     /     ,     £M!34_       V/,.  i 

+  '    +  A/32/34    '    '  +  A/33A  '  ' 

2^  Q/v.        3                         Q,.  2^. 

,     £!U_I34  iv     r     I     £!V34        iv     "/     1     '^^23  ^34       /'/  " 


To  simplify  the  notation,  v/,  v",  Vg"  Vg'''  have  been  used  for  i?)^,  ov,,,  ^Vg, 
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+  /3,^3/3.  +  +  ^2^3^.     '  ^ 

,     ^l£is!l4       /     I     ^^'l2^'l3^'23       /     /'     "/     I     6jVV"23        /  // 

.     6ji£l£23_       '     ///     I     ^£l£3A       '     //     I     ^''']2^^H^'24       /     '/  iv 

6ri2^'2^       /     iv     I         '12^23^24        '/  _i_  >'  III 

I     Q^^2^'23^'34    .,  '  ,  I     ^^'l3'''23''''34   ^,  '//     i     '^'l4''23''34   „.  m 

"T  a  Q  Q  o    ^\^\  o  o  o  o    ^'2,  ~r 


6^^3^V34     /  iv   I   ^^WA     III  iv   I  6ri^?V34 

/Si^o/33/3,  1  1  ^  AAA/34  '   '  ^  ^lA/SsA  ' 

.''34       //     ///     ;„     ,     ^''l3*'u''34  '//     Jv  1 


j  ^''33^'24'' 


lilAA  '  '  +AA  ^  ^   +AA^^  +AA  '  '   +  AA  ^  ^  ^  AA 


+ A.ry*     3  ill'T'  ^  /I  /v*     3  /)i  A  /)->     /v»  3 

^'12   '13  '/     I     ^'12' 13         /  I     ^'12    '14       f     ft     I     ^'12' 14        '  iir 

+  ^  "3  ^'^  +  AAA  + 

+ ^^13^  ^14       '  I      '^^  13'  14''       '     iv     I     '^'12''  ''23       "     '     I      '^''l2''33^  " 

AM        +  AAA  ^'^     +  AA^     ^'^  +  AA^3 

+  AAA  '         AAA  "3      +  AAAA    ^  +  AAAA  ^  ^ 

I     ^^12  ^24       /     "     ,     ^M2V       //     iv     I       ^^13  ^24        /  ,        ^M3^  24  '  iy 

+  AAA  "  +  /3iAA  '  '  ^  AAAA  ^  ^   +  AAAA  '  ^ 

4^14^  ,  '^'^'l4^'24^       '/     iv     1      l!i3^        //     ///     ,     jV^l       I'  iv 

+  AAA  '  '  ^  AAA  ^  ^  ^  AAA  '  '   +  AAA  ^ 

4.    i!j2!l34       /      '/     ,        4^'l2^'34''^         ///     iv      ,     4^'l3^  ^^34  /     ,     ^Mjl       "/  iv 

+  AAA  '  '  +  /3iAAA  '   '  +  AAA  ^      ^  AAA  ^  ^ 

,     gTu  ^34       /     iv     I     ^M4%  iv   4.  ^lailsi       "     '/      I     ^^23^34         "/  iv 

+  AAA  '  '  ^  AAA  '    '  +  AAA  ^  ^   +  AAA  ^ 

,     "^^^'24"  ^'34       "     iv     1     '^^24^'34''^  iv     1     6?V!i3^       ,     ,/     //,     ,     ^^'12^  ^4^       /     /'  iv 

+  AAA  '      ^  +  ^lAA  +  AAA     '  ' 

fir   2  ^   2  fi,.   3 2  «^   2      Z  2  ^  2 

1    "hs  ^14       '  iv    I    "M2  %        /     /'     ///     I     "MS  ^23        '     /'     I"     I      "^4  ^23         '  / 

+  AAA  "^^""^         AAA  "^^"^^      +  AAA  +  AAAA  ^ ^ 

^  AAA  +  AAAA  ^  '  '   '  +  AAA  '  '  '  ^  AAA 
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"nZ   ^S4  '/  ,        ,  iv    I     "'13   '34        '  iv    I    "'14  '34        '     /"     iv     l     "'23  '34 

,    6^2/7-34^      „  12^3' ^3^14     '  "    ,    ^^JMjlhi  ^  '^/"    I  ^^^^2^3^-14' 

+    ^,/33^,  +     A^2^3^4  ^     AA/33^4  ^     A/32/33^4  ^ 

L  ^13^^23  '/     '//     I     ^^^'l2^'l3^^'23  •'  ,      ,  I     -^^^'l2^'l3^23'^       '     "  /'/ 

+     A^2/33/34     '^'^  +     A/32/33/34  +     ^^/32A/34  "^^"^^ 

12?\3^  ^14^*23     /    '/    ,    l^r^gr^/rgg  .        12n3^'i4^'23^     /  "/ 

+   A/32A/34     '  '  +   A/32/33A  ^  /3iAA/3.  ''^'^ 

1      12i%2" '''i3''24       /     //  1      125%2''i3^ ''24       /  '//     1      12^]^2^'l3^24''       /     /'  iv 

+  Id  o  0  0     "^'a  '^'3  +     o  o  a  Q     '^'Z  '^1  '^1      +     o  000     '^'l  '^'l  '^'l 


/3iMs^i    '  '         AA^3A    '  '    '  ^1/32/33/3. 


I     12ri3"  7-23%       ,     „  1  ^^'12^^23^  ^'34       //     '//     ,  '^^'>"uV^~       //  iv 

+     A/32^3/34  ^  ^  A^2/33A     ^   ^  /3AA^4  ' 

12?j3^r2£24     /    "    ///    ,    12r^3r3/?-2t  l^r^gV-ag^-g./     ,  „ 

+     AA^3A  ^       +     AA2/33/34     '^'^  +     AA/33A  '^'^ 


12^14^^237-24       ,     „  127j£23i^4       //     W     iv     I     12^14';23'i4^       //  i, 


127-13^7-1,7-3,        ,  12ri3ri4V3,        „  12ri37-i,7-3/ 

+    AAA   "^^"^^      +    AAA  AAA 

I     12^*12^  ^'23^34  //.,  //'     I     12^'l2^'23^^'34       II.,  Ill     ■     12^i2^y34^  ^   //  .  ///  ,  iv 


,     12^3^  ^y34       /     iv     1     12>^3^'23'^  ''34       //     ///     ,     12^'l3^'23^'^^       ///  ,v 

+  AAAA  ^ +  AAAA  ^      ^  /3,AAA 


_1_   ^:^4ll2£3t       /      ///     iv  _l_    12''ur23"  ^'34       //  iv     l      ^^MR   „  iv 

+  ""^'^        '    AAAA  ^       '  +  AAAA  ^ 


127'io"  7'q4.7'q,        ,     ,,      .,     .     12'/'io7-o-2  7>  12/'io7 


,     —'12   '24' 34       /  //     iv     ,     ^^'12' 24" '  34       "     iv     I     -^^' 12'V^'34'       //     '//  iv 

+  /3iAAA  ^'i        +  /3iAA.A   '  '  +  AAAA  ^  '  ' 

127-13^  7-247-3,  „  i2/;i37'242  7-,,       ,  12/^.,r247-3/  „ 

+  AAAA  '  +  AAAA  ^  ^    '  ^  mm   '  ' 
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12r-,J'n,')\,     .   ....   .    \  2r-,/r„/ „   ...   .  \%\/i\,r^? 


^   ^,AA3/3.  ^  '1    3    -t-  ^^^^^^^^  ti 


+       A^3^,  +       ^^^^^^       ^.^1^2       +  ^^^^^^ 

+    ^,^,^3/34    "^^'^"^^     ^    A^./33/3.  ^    A  A/33/3.  ''^'^''^ 

+   A  A/33/34  +  A  AAA  '^^'^^''^      ^  AAA/37  "^^""^ 

,     24ri2?VWi4       '     //     "/     ,     247j£i£2£3.^  ,      „  24?'^37'i//'o3r3^       ,  „ 

+  AAAA  ^  '  '   ^  AAAA  ^  '  '    '  +   ^lAAA    '  '  ' 
+   AAAA  '  +  AAAA  ^  ^      +  AAAA    '  '  ' 

^  ^,AAA       '      +  AAAA  +  ^.AAA  J 

.  (Ixxxiii,). 

In  the  case  of  three  variables,  we  must  cancel  in  the  above  expression  all  terms 
involving  A..  Thus  we  shall  have  3  instead  of  6  first  order  terms,  6  instead  of  21 
second  order  terms,  10  instead  of  56  third  order  terms,  and  15  instead  of  126  fourth 
order  terms — a  much  more  manageable  series. 

I  give  below  the  extra  term  necessary  for  calculating  the  value  of  (Q  —  Qo)/Qo  as 
far  as  the  fifth  order  terms  in  the  case  of  three  variables. 

Fifth,  Order  Terms  for  Three  Variables. 

1    f  '^'a'^^        //     ///     1       '31^        ///     /     I       'li^        I     "     I     ^^31*'l3       ///  / 

eI^,  «  +  Aft         +  ftA    "*  +  ftftft 

I      ^',31*^23       /     ///     I      ^'12*  ^'23       '      //     I      ^^12*^  31       /  " 

+  isM  "3""  +  ftft^"^<  +  ftft^, 

K).    4,,  4,,.  in,.    3„.  2 

1     '-''23    '31       "     "/     i      -^'23   '12        /'     ///     I      "-"'SI    '12         '     //  f" 

+  +ftftft"*"»  +  ftAft""^"'"^ 

in..  3,,  2  in,.  3,,.  2  in..  3..  2 

1      -^"31      "3         '     /       ///     1  12     "3         '      "     ///     I  12     .31         '     "  /// 

10/-        "  ^0r         '  20r  r 

I     -"-^'33   '31         /     "     ///     1     -"^"'23   '12         '     "  I     ■^*^'23   '3l'l2  ' 

+  ft/JA       +  -ft^A  "'"^'^  +"ftft^; 

•      +    AAA   ''''   +    AAA  ^    AAA  "^^"^^^ 

^    AAA     '  '  '  ^    AAA    ""^^^^M U™v.;. 
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A  numerical  illustration  of  these  formulse  will  be  given  in  the  latter  part  of  this 
Memoir.    It  will,  however,  be  clear  that  what  we  want  are  tables  of  log  including 

log  or  log  (^—^  for  a  series  of  values  of  h.  Such  tables  would  render  the  compu- 
tation of  ^         fairly  direct  and  rapid ;  they  could  be  fairly  easily  calculated  from 

existing  tables  for  the  ordinate  and  area  of  the  normal  curve,  and  I  hope  later  to 
find  some  one  willing  to  undertake  them. 

Meanwhile  let  us  look  at  special  cases.    In  the  first  place,  suppose,  in  the  case  of 
three  variables,  that  the  division  of  the  groups  is  taken  at  the  mean,  i.e.,      =  h.-,  = 
7^3  =  0.    Then  we  have 


A  =      =  ^3  =  fe-^^'  dx  =        I . 


'  0 

=  v^'  =  v{"  =  0 

<  =  <  =  -1 

<  =  <'  =  v'"  =  0 


^3    —   "^3     —  "^3 

Hence  we  have 


^      fo  io  L  ^ c^a^3  =  Qo  j  1  +  |r  (^13  +  ^13  +  ^23)} 

+  ^  {~{rn'  +  V  +  V)}  +  ll  [vHr,,'  +  ^^31^  +  •  •  • 

=  Qo  j  1  +  ~  (sin-i      +  sin'^  ^-^g  +  sin-^  r.3)  |  (Ixxxv.). 

Let  =  cos  D^^,  r^g  =  cos  D^g,  rgg  =  cos  D,,,,  and  let  E  be  the  spherical  excess  of 
the  spherical  triangle  whose  angles  are  the  divergences  D^g,  D^g,  D23.  Then 
we  have 

sin^^f  =  cosE     .......  (Ixxxvi.). 

Now  take  the  case  of  four  variables.    Here  we  have 

^1  =     =     =     =  a/' 


IT 

2 

III 


=      ■  r=  =         =  1 


and  all  the  odd  v's  zero.  Hence 

VOL,  cxcv. — A.  F 
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Q  -  Qo       2  ,  ^   .  /2\2, 

Q 


=      ('-12  +  r,,  +  rj,  +  r,,  +  r,,  +  r,,)  +         {r,,r,,  +  r,,r,,  +  r,,r,,) 


/V»  'V* 

'  12' 23' 24 


/  2  \3  1 

+  ^'is'-SS^Si  +  ^Yir24r34)  +  (  -  )    ^(^'mSs  +  ''14^23'  +  ^13^2./  +  ^^13'^24  +  ^^l/^'34 


/  2  \2  ]_ 

+  ^'12^3./)  +  [~)  i2^{rn'rur,,  +  r,,\^r,,  +  r,^\,r,.,  +  ^i3ri4'^"24  +  '^3^23'''24 

+  ^14''23^24'  +  V.^^i  +  *-l?n^'34  +  ^^12^23''-3i  +  ^•m.^'23^'34'  +  ^13^2/^34. 

+  ^13'W^34^)  +  (IxXXvii.). 

This  is  the  correct  value  including  terms  of  the  fourth  order,  but  to  this  order  of 
approximation  we  can  throw  it  into  a  much  simpler  form.    Let      =  sin  S,,,,  then 


^-Q-^  ^  =  sin  1 +  sin  ^  r^g  +  sin  ^  r^^  -f  sin  ^  r.^  +  sin"!      +  sin"!  r^^ 
2  . 

+  ^  (sin~^      sin~^      sin~^  r^^  +  sin~^      sin~^  r.^^  sin~^ 

+  sin-i  rjg  sin-i  ^23  sin"^  r^.^.  +  sin"^  r^^  sin"!  7^24  sin"!  7-3^) 

+  I  [sin-i  r,,  sin-i  r23{(l  -  r^/)  (1  -  r,3^)  (1  -  r,/)  (1  -  rs/)]-^ 
+  sin-^  ;v3sin-i  _  ^^^2)     _  ,,^^2)     _  ,,^^2)     _  ,,^^2)|-i 

+  sin-^-,3  sin-i  -  n/)  (1  -  V)  (1  -  n.3^)  (1  -  V)}-*] 

■  §13  +  S]3  +  §14  +  §23  +  §24  +  §34 
2 

+  ^  (§12§13§14  +  §13§23§24  +  §13§33§34  +  §14§24§34) 

2   /S14S23  cos  823  +  cos        cos         -f-      382.4  COS  8,3  COS  §24 


\  cos       COS       cos        COS  823  cos  834  COS  834 

(Ixxxviii.), 

We  may  write  this  sin  —  =  cosE'  (Ixxxix.) 

Qo      2  ^  ' 

where 

=  Y  ~  §13  "~  §13  —  §14  —  §23  —  §24  "  §34 

2 


(§12§]3§14  +  §12§23§24  +  §13§23§24  +  §14§24§34) 


2  /'8^4823  COS  8)^^  cos  803  +  8]283.),cos  8j2Cos  834  +  8^3824  cos  8^3  cos  834 

TT  \  cos  8]  2  COS  8^3  COS  814  COS  S23  cos  834  cos  834 

The  expressions  E  and  E'  of  (Ixxxvi.)  and  (Ixxxix.)  are  of  considerable  interest,  for 
they  enable  us  to  express  the  area  of  a  spherical  triangle  in  three-dimensioned  space, 
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and  (up  to  the  above  degree  of  approximation)  the  volume  of  a  "  tetrahedron  "  on  a 
"  sphere  "  in  hyperspace  of  four  dimensions.  In  fact,  the  whole  theory  of  hyperspace 
"  spherical  trigonometry  "  needs  investigation  in  relation  to  the  properties  of  multiple 
correlation. 

In  our  illustrations  (viii.)  and  (ix.)  will  be  found  examples  of  the  above  formulas 
applied  to  important  cases  in  triple  and  quadruple  correlation  in  the  theory 
of  heredity.  I  consider  that  the  formulae  above  given  will  cover  numerous  novel 
apphcations,  for  many  of  which  greater  simplicity  will  be  introduced  owing  to  the 
choice  of  special  values  for  the  h's  or  for  the  correlation  coefficients. 


(8.)  Illustrations  of  the  New  Methods. 


Illustration  I.  Liheritance  of  CoaUcolour  in  Horses. — The  following  represents 
the  distribution  of  sires  and  fillies  in  1050  cases  of  thoroughbred  racehorses,  the 
grouping  being  made  into  all  coat-colour  classed  as  "  bay  and  darker,"  "  chesnut  and 
lighter  "  :— 


Sires. 

Colour. 

Bay  and 
darker. 

Chesnut  and 
lighter. 

m 
93 

Bay  and  darker  .    .  . 

631 

125 

756 

Chesnut  and  lighter 

147 

147 

294 

778 

272 

1050 

a 

b 

\a  +  h 

c 

d 

c  +  d 

a  +  c 

h  +  cl 

N 

Then  we  require  the  correlation  between  sire  and  filly  in  the  matter  of  coat-colour, 
and  also  the  probable  erroi-  of  its  determination. 
We  have  from  (iv.)  and  (v.) 


{a  +  c)  -  (h  -I-  d) 
(«  -f  5)  —  (c  d) 


=  \/~  jV^^'c^^c  =  -481 
=  Ay^\y^'''dy  =  -440, 


^2  -  N 

Hence  from  the  probability  integral  tables 

h  =  -64630,  k  =  -58284. 

We  have  then  :  log  HK  =  1  -037,3514  by  (xvii.), 

F  2 


,905, 
000. 
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Thence  e  =  ^^^'^^^^  =  '619,068  from  (xxi.). 

N~HK  ^  ' 

Calculating  out  the  coefficients  of  the  series  in  r  in  (xix.)  we  find 

•619,068  =  r  +  -188,3459^3  +  •064,0814r«  +  -107,8220^^  +  •005,9986r5  +  •067,2682r6 

+  &c. 

Neglecting  powers  of  r  above  the  second,  we  find  by  solving  the  quadratic  and 
taking  the  positive  root 

r  =  -5600. 

Solving  by  two  approximations  the  sextic  we  finally  determine 

r  =  -5422, 

correct,  I  think,  to  four  places  of  figures. 

Turning  now  to  the  probable  error  as  given  by  Equation  (1.),  I  find 

7^3  4_  p  _  2rhk  =  "348,924, 

and  from  (xlix.) 

logxo  =  1"170,0947. 
Further  :  =  '275,642  ,        4^=^  =  -393,078. 

Hence  from  (xlvii.)  and  (xlviii.)  we  find 

1       r-393,078  1  r275,642 

and  by  means  of  the  probability  integral  table 

=  -108,884,        i//.  =  -152,865. 
By  substituting  in  (1.),  we  find 

probable  error  of  r  —  -0288. 
From  (xxxiv.)  and  (xxxv.)  we  find 

p.e.  of  h  —  -0282.  p.e.  of  ^  =  -0278. 

Thus,  finally,  we  may  sum  up  our  results 

U  =  -6463  ±  -0282,  k  =  '5828  ±  *0278, 

r  =  -5422  ±  -0288. 

The  probable  error  of  this  r,  if  we  had  been  able  to  find  it  from  the  product 
moment,  would  have  been  -0147,  or  only  about  one-half  its  present  value. 
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Illustration  II. — Our  analysis  opens  a  large  field  suggested  by  the  following 
problem: — What  is  the  chance  that  an  exceptional  man  is  horn  of  an  exceptional 
father  ? 

Of  course  much  depends  on  how  we  define  "  exceptional,"  and  any  numerical 
measure  of  it  must  be  quite  arbitrary.  As  an  illustration,  let  us  take  a  man  who 
possesses  a  character  only  possessed  by  one  man  in  twenty  as  exceptional.  For 
example,  only  one  man  in  twenty  is  more  than  6  feet  1  "2  inches  in  height,  and  such  a 
stature  may  be  considered  "  exceptional."  In  a  class  of  twenty  students  we  generally 
find  one  of  "  exceptional "  ability,  and  so  on.  Accordingly  we  have  classed  fathers  and 
sons  who  possess  characters  only  possessed  by  one  man  in  twenty  as  exceptional.  We 
first  determine  h  and  ^,  so  that  the  tail  of  the  frequency  curve  cut  ofi"  is  of  its 
whole  area.    This  gives  us,  h  =■  k  =  1  "64485. 

Next  we  determine  HK  =  ^e-*''''+'''\  and  find  log  HK  =  2-026,8228. 

Then  we  calculate  the  coefficients  of  the  various  powers  of  r  in  (xix.).    We  find 

log  IM  = -131,2225. 

logi(/i2-  1)  =  1-685,5683. 

log^(^2  _  3)(^2  -  3)  =  3-990,1176. 

log  rio      -  6/^^  +  3)  {¥  —  6F  +  3)  =  1-464,4772. 

Jik 

^og^^{h^  —  lOh^  +  15)  (^*-  10P+  15)  =  2-925,6367. 

It  remains  to  determine  what  value  we  shall  give  to  r,  the  paternal  correlation.  It 
ranges  from  -3  to  -5  for  my  own  measurements  as  we  turn  from  blended  to  exclusive 
inheritance.    Taking  these  two  extreme  values  we  find 

(zd  "he 

-^r—  =  -0046344     or  -0096779. 

-J3      ad  — he        d        (d  +  h){d  +  c)  •      1       i  /.  •  1 

-t)ut  — ~  "jf  ^  '         ^"-^  second  term  is  the  chance  of  exceptional 

fathers  with  exceptional  sons,  when  variation  is  independent,  i.e.,  when  there  is  no 
heredity,  =       X  -ro  =  "0025. 

Thus  c^/N  =  -007134    or    -012178 ; 

accordingly  &/N  =  -042866    or  -037822. 

Hence  we  cionclude  that  of  the  5  per  cent,  of  exceptional  men  "71  per  cent,  in  the 
first  case,  and  1-22  per  cent,  in  the  second  case,  are  born  of  exceptional  fathers,  and 
4-29  per  cent,  in  the  first  case  and  3-78  per  cent,  in  the  second  case  of  non-exceptional 
fathers.  In  other  words,  out  of  1000  men  of  mark  we  may  expect  142  in  the  first  case, 
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244  in  the  second,  to  be  born  of  exceptional  parents,  while  858  in  the  first  and  756  in 
the  second  are  born  of  undistinguished  fathers.  In  the  former  case  the  odds  are  about 
6  to  1,  in  the  latter  3  to  1  against  a  distinguished  son  having  a  distinguished  father. 
This  result  confirms  what  I  have  elsewhere  stated,  that  we  trust  to  the  great  mass  of 
our  population  for  the  bulk  of  our  distinguished  men.  On  the  other  hand  it  does  not 
invalidate  what  I  have  written  on  the  importance  of  creating  good  stock,  for  a  good 
stock  means  a  bias  largely  above  that  due  to  an  exceptional  father  alone. 

In  addition  to  this  the  yo  of  the  population  forming  the  exceptional  fathers  pro- 
duce 142  or  244  exceptional  sons  to  compare  with  the  858  or  756  exceptional  sons 
produced  by  the  1§  of  the  population  who  are  non-exceptional.  That  is  to  say  that 
the  relative  production  is  as  142  to  45*2,  or  244  to  39"8,  i.e.,  in  the  one  case  as  more 
than  3  to  1,  in  the  other  case  as  more  than  6  to  1,  Li  other  words,  exceptional 
fathers  produce  exceptional  sons  at  a  rate  3  6  times  as  great  as  non-exceptional 
fathers.  It  is  only  because  exceptional  fathers  are  themselves  so  rare  that  we  must 
trust  for  the  bulk  of  our  distinguished  men  to  the  non-exceptional  class. 

Illustration  III.  Ilerediti]  in  Coat-colour  of  Hounds. — To  find  the  correlation 
in  coat-colour  between  Basset  hounds  which  are  half-brethren,  say,  offspring  of  the 
same  dam. 

Here  the  classification  is  simply  into  lemon  and  white  (Jiw)  and  lemon,  black  and 
white  or  tricolour  [t), 

The  following  is  the  table  for  4172  cases  :  — 


Colour. 

t. 

Iw. 

Totals. 

t. 

1766 

842 

2608 

Iw. 

842 

722 

1564 

Totals 

2608 

1564 

4172 

Proceeding  precisely  in  the  same  way  as  in  the  first  illustration  we  find  : 

=  ttg  =  -25024 
h  =  k  =  -318,957 
log  KH  =  1-157,6378 
e=  -226,234. 

It  will  be  sufiicient  now  to  go  to  r*.    We  have 

-226,234  =  r-\-  -050,867  r^      -134,480  r^  -f  -035,587  7-\ 
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The  quadratic  gives  r  =  -2237.  Using  the  Newtonian  method  of  approximating  to 
the  root  we  find 

r  =  -2222. 

Summing  up  as  before,  after  finding  the  probable  errors,  we  have 

h  =  k=  -3190  ±  -0133, 
r  = -2222  ±  •0162. 

Illustration  IV.  Inheritance  of  Eye-colour  in  Man.--To  find  the  correlation  in 
eye-colour  between  a  maternal  grandmother  and  her  granddaughter.  Here  the 
classification  is  into  eyes  described  as  grey  or  lighter,  and  eyes  described  as  dark  grey 
or  darker.* 


Tint. 

Maternal  grandmother. 

Totals. 

Grey  or  lighter. 

Dark  grey  or 
darker. 

Granddaughter. 

Grey  or  lighter .... 

254 

136 

390 

Dark  grey  or  darker  .  . 

156 

193 

349 

Totals  

410 

329 

739 

As  before,  we  find 

aj  =  -109,607,  =  '055,480, 

h  =  -138,105,  Jc  =  -069,593, 

log  HK  =  T-196,6267, 
e  =  -323,760. 

Series  for  r  up  to 

-323,760  =  r  +  •004,806r3  +  -162-696r3  +  -000,358r^ 
The  quadratic  gives  r  =  -3233,  and  the  biquadratic 

r  =  -3180, 

the  value  of  the  term  in     being  "000,00366,  so  that  higher  terms  may  be  neglected. 
Determining  the  probable  errors  as  in  Illustration  I.,  we  sum  up  : — 

*  According  to  Mr.  Galton's  classification,  the  first  group  contains  eyes  described  as  light  blue,  blue, 
dark  blue,  blue-green,  grey ;  and  the  second  eyes  described  as  dark  grey,  hazel,  light  brown,  l^rown,  dark 
brown,  verv  dark  brown,  black. 
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h  =  -1381  ±  -0312, 
k  =  -0696  db  '0311, 
r  =  -3180  ±  -0361. 

Illustration  V.  Inheritance  of  Stature. — The  following  data  have  been  found  for 
the  inheritance  of  stature  between  father  and  son  from  my  Family  Data  cards,  1078 
cases  : — 

Mean  stature  of  father  .    .    .    .  67"-698 

son  68"-661 

Standard  deviation  of  father  .    .  2"*7048 

son.    .    .  2"-7321 

Correlation  =  -5198  db  '0150. 

Now  for  purposes  of  comparison  of  methods  the  correlation  has  been  determined 
for  this  material  from  various  groupings  of  fathers  and  sons  : — 


(A.)  Fathers. 


Class. 

Below  67"-5. 

Above  67"-5. 

Totals. 

Below  67"-5   .  . 

269-25 

95-75 

365 

Above  67"-5  .  . 

232-25 

480-75 

713 

Totals    .    .  . 

501-5 

576-5 

1078 

(B.)  Fathers. 


Class. 

Below  66"-5. 

Above  66"-5. 

Totals. 

Below  67" -5   .  . 

211-25 

153-75 

365 

Above  67"-5  .  . 

152-75 

560-25 

713 

Totals    .    .  . 

364 

714 

1078 
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Fathers. 


Class. 

Below  67"-5. 

Above  67"-5. 

Totals. 

Below  68-5"   .  . 

356*25 

182-25 

538-5 

Above  68-5"   .  . 

145-25 

394-25 

539-5 

Totals    .    .  . 

501-5 

576-5 

1078 

(D.)  Fathers. 


Class. 

Below  68"-5. 

Above  68"-5. 

Totals. 

Below  6  9" -5    .  . 

506 

182 

688 

Above  69"-5   .  . 

149-5 

240-5 

390 

Totals 

655-5 

422-5 

1078 

(E.)  Fathers. 


Class.  Below  69"-5.  Above  69"-5.  Totals. 


Below  70"-5   .    .  669  147  816 


Above  70"-5   .    .  128  134  262 


Totals    ...  797  281  1078 


(F.)  Fathers. 


Class. 

Below  70" -5. 

Above  70" -5. 

Totals. 

Below  69"-5    .  . 

641-25 

46-75 

688 

AboN^e  6  9" -5   .  . 

271-75 

118-25 

390 

Totals    .    .  . 

913 

165 

1078 
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Table  of  Results. 


Classificatiom. 

Correlation. 

Mean  of  sons. 

Mean  of  fathers. 

A 
B 
C 
D 
E 
F 

•5939  ±  -0247 
•5557  +  -0261 
•5529  +  -0247 
•5264  +  ^0264 
•5213  ±  -0294 
•5g24  +  -0307 

k. 

68"^64  (-•416,32) 
68"^64  (  -^416,32) 
68"^50  (-•001,16) 
68"-53  (-353,71) 
68"-60  (-696,57) 
68"-53  (-353,71) 

h. 

67"-74  ( -  -087,00) 
67"-63  (  --418,86) 
67"-74  (--087,30) 
67"-77  (-274,30) 
67"-76  (-641,30) 
67"-73  (1-023,44) 

Now  these  results  are  of  quite  peculiar  interest.    They  show  us  : — 

(i.)  That  the  probable  error  of  r,  as  found  by  the  present  method,  increases  with 
h  and  h  But  the  increase  is  not  very  rapid,  so  that  the  probable  errors  of  the  series 
range  only  between  "025  and  '031.  Hence  while  it  is  an  advantage,  it  is  not  a  very 
great  advantage,  to  take  the  divisions  of  the  groups  near  the  medians.  It  is  an 
advantage  which  may  be  easily  counterbalanced  by  some  practical  gain  in  the  method 
of  observation  when  the  division  is  not  close  to  the  medians. 

(ii.)  While  the  probable  error,  as  found  from  the  present  method  of  calculation,  is 
1'5  to  2  times  the  probable  error  as  found  from  the  product  moment,  it  is  by  no 
means  so  large  as  to  seriously  weigh  against  the  new  process,  if  the  old  is  un- 
available. It  is  quite  true  that  the  results  given  by  the  present  process  for  six 
arbitrary  divisions  differ  very  considerably  among  themselves.  But  a  consideration 
of  the  probable  errors  shows  that  the  differences  are  sensibly  larger  than  the  prob- 
able error  of  the  differences,  even  in  some  case  double  ;  hence  it  is  not  the  method 
but  the  assumption  of  normal  correlation  for  such  distributions  which  is  at  fault.  As 
we  shall  hardly  get  a  better  variable  than  stature  to  hypothesise  normality  for,  we 
see  the  weakness  of  the  position  which  assumes  without  qualification  the  generality 
of  the  Gaussian  law  of  frequency. 

(iii.)  We  cannot  assert  that  the  smaller  the  probable  error  the  more  nearly  will 
the  correlation,  as  given  by  the  present  process,  agree  with  its  value  as  found  by 
the  product  moment.  If  we  did  we  should  discard  "5213,  a  very  accordant  result, 
in  favour  of  "5529,  or  even  "5939.  The  fact  is  that  the  higher  the  correlation  the 
lower,  ceteris  'paribus,  the  probable  error,  and  this  fact  may  obscure  the  really  best 
result.  Judging  by  the  smallness  of  h  and  k  and  of  the  probable  error,  we  should 
be  inclined  to  select  C  or  the  value  "5529.  This  only  differs  from  '5198  by  slightly 
more  than  the  probable  error  of  the  difference  ('033  as  compared  with  '029)  ;  but 
since  both  are  found  from  the  same  statistics,  and  not  from  different  samplings  ot 
the  same  population,  this  forms  sufficient  evidence  in  itself  of  want  of  normality. 
The  approximate  character  of  all  results  based  on  the  theory  of  normal  frequency 
must  be  carefully  borne  in  mind ;  and  all  we  ought  to  conclude  from  the  present 
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data  lor  inheritance  of  stature  from  father  to  son  would  be  that  the  correlation 
=  *55  ±  '015,  while  the  product  moment  method  would  tell  us  more  definitely  that 
its  value  was  '52  ±  '015.  There  is  no  question  that  the  latter  method  is  the  better, 
but  this  does  not  hinder  the  new  method  from  being  extremely  serviceable ;  for 
many  cases  it  is  the  only  one  available. 

Illustration  VI.  Effectiveness  of  Vaccination. — To  find  the  correlation  between 
strength  to  resist  small-pox  and  the  degree  of  effective  vaccination. 

We  have  in  the  earlier  illustrations  chosen  cases  in  which  in  all  probability  a  scale 
of  character  might  possibly,  if  with  difiiculty,  be  determined.  In  the  present  case, 
the  relationship  is  a  very  important  one,  but  a  quantitative  scale  is  hardly  discover- 
able. Nevertheless,  it  is  of  great  interest  to  consider  what  results  flow  from  the 
application  of  our  method.  We  may  consider  our  two  characters  as  strength  to  resist 
the  ravages  of  small-pox  and  as  degree  of  effective  vaccination.  No  quantitative 
scales  are  here  available  ;  all  the  statistics  provide  are  the  number  of  recoveries 
and  deaths  from  small-pox,  and  the  absence  or  presence  of  a  definite  vaccination 
cicatrix.  Taking  the  Metropolitan  Asylums  Board  statistics  for  the  epidemic  of  1893, 
we  have  the  table  given  below,  where  the  cases  of  "  no  evidence"  have  been  omitted. 
Proceeding  in  the  usual  manner  we  find 

=    -86929  a2  =  -54157 

h  =  1-51139  k  =  -74145 

e  =  -782454. 

Hence  the  equation  for  r  is 

•782,454  =  r  +  •560,310r3  -  •096,378r3  +  -081,881     -  •000,172r5  -  -040,059^6 

whence  r  =  -5954, 

Summing  up  we  have,  after  calculating  the  probable  errors, 

h  =  1-5114  ±  -0287, 
k  =  -7414  ±  -0205, 
r=    -5954  -0272. 

Strength  to  resist  Small-pox  when  incurred. 

Total. 


1604 


477 


2081 


Cicatrix. 

Recoveries. 

Deaths. 

Present 

1562 

42 

Absent 

383 

94 

Total 

1945 

136 

G  2 
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We  see  accordingly  that  there  is  quite  a  large  correlation  between  recovery  and  the 
presence  of  the  cicatrix.  The  two  things  are  about  as  closely  related  as  a  child  to  its 
"  mid-parent."  While  the  correlation  is  very  substantial  and  indicates  the  protective 
character  of  vaccination,  even  after  small-pox  is  incurred,  it  is,  perhaps,  smaller  than 
some  over- ardent  supporters  of  vaccination  would  have  led  us  to  believe. 

Illustration  VII.  Effectiveness  of  Antitoxiyi  Treatment. — To  measure  quanti- 
tatively the  effect  of  antitoxin  in  di23htheria  cases. 

In  like  manner  we  may  find  the  correlation  between  recovery  and  the  administration 
of  antitoxin  in  diphtheria  cases.  The  statistics  here  are,  however,  somewhat  difficult 
to  obtain  in  a  form  suited  to  our  purjDose.  The  treatment  by  antitoxin  began  in  the 
Metro2:)olitan  Asylums  Board  hospitals  in  1895,  but  the  serum  was  then  administered 
only  in  those  cases  which  gave  rise  to  anxiety.  Hence  we  cannot  correlate  recovery 
and  death  with  the  cases  treated  or  not  treated  in  that  year,  for  those  who  were  likely 
to  recover  were  not  dosed.  In  the  year  1896  the  majority  of  the  cases  were,  on  the 
contrary,  treated  with  antitoxin,  and  those  not  treated  were  the  slight  cases  of  very 
small  risk  ;  hence,  again,  we  are  in  great  difficulties  in  drawing  up  a  table.*  Further, 
if  we  compare  an  antitoxin  year  with  a  non-antitoxin  year,  we  ought  to  compare  the 
cases  treated  with  antitoxin  in  the  former  year  with  those  which  Avould  probably  have 
been  treated  with  it  in  the  latter  year.  Lastly,  the  dosage,  nature  of  cases  treated, 
and  time  of  treatment  have  been  modified  by  the  experience  gained,  so  that  it  seems 
impossible  to  club  a  number  of  years  together,  and  so  obtain  a  satisfactorily  wide 
range  of  statistics.  In  1897,  practically  all  the  laryngeal  cases  were  treated  with 
antitoxin.  Hence  the  best  we  can  do  is  to  compare  the  laryngeal  cases  in  two  years, 
one  before  and  one  after  the  introduction  of  antitoxin.  The  numbers  available  are 
thus  rather  few,  but  will  help  us  to  form  some  idea  of  the  correlation.  I  take  the 
following  data  from  p.  8  of  the  Metropolitan  Asylums  Board  '  Report  upon  the  Use  of 
Antitoxic  Serum  for  1896  ' : — 


Laryngeal  cases. 

Recoveries. 

Deaths. 

Totals. 

With  antitoxin,  1896  .... 

319 

143 

462 

Without  antitoxin,  1894  .    .  . 

177 

289 

466 

496 

432 

928 

*  When  a  new  drug  or  process  is  introduced  the  medical  profession  are  naturally  anxious  to  give  every 
patient  the  possible  benefit  of  it,  and  patients  of  course  rush  to  those  who  first  adopt  it.  But  if  the  real 
efficiency  of  the  process  or  drug  is  to  be  measured  this  is  very  undesirable.  No  definite  data  by  which  to 
measure  the  eflfcctiveness  of  the  novelty  are  thus  available. 
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Here  I  find  r  =  '4708  ±  -0292. 
A  further  table  is  of  interest  : — 


Laryngeal  cases. 

Kequinng 
tracheotomy. 

Not 
requiring  it. 

Without  antitoxin,  1894  .    .  . 

261 

205 

466 

With  antitoxin,  1896   .    .    .  . 

188 

274 

462 

Totals  

449 

479 

928 

In  this  case  we  have  r  —  -2385  ±  "OSSS. 
Lastly,  I  have  drawn  up  a  third  table  : — 


Total  Infantile  Cases,  Ages  0 — 5  years. 


Recovery. 

Death. 

Totals. 

With  antitoxin,  1896    .    .  . 

912 

434 

1346 

Without  antitoxin,  1894   .  . 

615 

556 

1171 

Totals  

1527 

990 

2517 

Here  we  have*  t  —  -2451  ±  -0205. 

The  three  coefficients  are  all  sensible  as  compared  with  their  probable  errors,  and 
that  between  the  administration  of  antitoxin  and  recovery  in  laryngeal  cases  is 
substantial.  But  the  relationship  is  by  no  means  so  great  as  in  the  case  of  vaccina- 
tion, and  if  its  magnitude  justifies  the  use  of  antitoxin,  even  when  balanced  against 
other  ills  which  may  follow  in  its  train,  it  does  not  justify  the  sweeping  statements  of 
its  effectiveness  which  I  have  heard  made  by  medical  friends.  It  seems  until  wider 
statistics  are  forthcoming  a  case  for  cautiously  feeling  the  way  forward  rather  than  for 
hasty  generalisations. 

Illustration  VII L  Effect  on  Produce  of  Superior  Stock. — To  find  the  effect  of 
superiority  of  stock  on  percentage  goodness  of  produce. 

To  illustrate  this  and  also  the  formula  (Ixxxiii,)  for  six  correlation  coefficients,  we  will 
investigate  the  effect  of  selecting  sire,  dam,  and  one  grandsire  on  the  produce  when  there 

*  The  values  of  r  for  all  the  three  cases  of  this  Illustration  were  determined  with  great  ease  from 
Equation  (xxiv,). 
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is  selective  pairing  of  dam  and  sire.  We  will  suppose  grandsire,  dam,  and  sire  to  be 
above  the  average,  and  investigate  what  proportion  of  the  produce  will  be  above  the 
average.  As  numbers  very  like  those  actually  occurring  in  the  case  of  dogs,  horses, 
and  even  men,  we  may  take 

Correlation  of  grandsire  and  offspring   .  =  "25 

,,  sire  or  dam  and  offspring  =  '5  in  both  cases 

,,  sire  and  grandsire  .    .    .  =  '5 

Selective  mating  for  sire  and  dam.    .    .  =  "2 

We  will  suppose  zero  correlation  between  paternal  grandsire  and  dam,  although 
with  selective  mating  this  may  actually  exist.  We  have  then  the  following 
system  : — 

r^^  =■  '25,    r^^  =  '5,        =  '5,    rgg  =  '2,    r-^^  —  *5,    ''"is  =  0. 

Hence,  substituting  these  values  in  (Ixxxvii.),  we  find — after  some  arithmetic  : 

(Q  -  Qo)/Qo=  1-4851. 

But  Qq  is  the  chance  of  produce  above  the  average  if  there  were  no  heredity 
between  grandsire,  sire,  and  dam,  and  no  assortative  mating, 

N 

Hence  it  equals  2XiXi-XfN  =  —     .-.  Q  =  -1553  N. 

Or,  of  the  produce  '5  N  above  the  average,  "ISSS  N  instead  of  '0625  N  are  born  of 
the  superior  stock  owing  to  inheritance,  &c.  In  other  words,  out  of  the  '5  N  above 
the  average,  "1553  N  are  produced  by  the  stock  in  sire,  dam,  and  grandsire  above  the 
average,  or  by  "1827  of  the  total  stock.f  The  remaining  "8173  only  produce  "3447  N, 
or  the  superior  stock  produces  produce  above  the  average  at  over  twice  the  rate  of  the 
inferior  stock.  Absolutely,  the  inferior  stock  being  seven  times  as  numerous  produces 
about  seven-tenths  of  the  superior  offspring. 

Illustration  IX.  Effect  of  Excejjtional  Parentage. — Chance  of  an  exceptional 
man  being  born  of  exceptional  parents. 

Let  us  enlarge  the  example  in  Illustration  II. ,  and  seek  the  proportion  of  exceptional 
men,  defined  as  one  in  twenty,  born  of  exceptional  parents  in  a  community  with 
assortative  mating. 

*  A  correlation,  if  there  be  substantial  selective  mating,  may  exist  between  a  man  and  his  mother-in- 
law.  Its  rumoured  absence,  if  established  scientifically,  would  not,  however,  prove  the  non-existence  of 
selective  mating,  for  A  may  be  correlated  with  B  and  C,  but  these  not  correlated  with  each  other. 

t  The  proportion  of  pairs  of  parents  associated  with  a  grandsire  above  the  average  was  found  by 
putting  -5,  -2,  and  0  for  the  three  correlation  coefficients  in  (Ixxxv.).  In  comparing  with  Illustration  II., 
the  reader  must  remember  we  there  dealt  with  an  exceptional  father,  1  in  20,  here  only  with  relatives 
above  the  average— a  very  less  stringent  selection. 
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Here  we  take  for  father  and  son  =  '5,  for  mother  and  son  =  -5,  and  for 
assortative  mating,      =  "2. 

We  have  then  to  apply  the  general  formulae  (Ixxxiii.)  and  (Ixxxiv.)  for  the  case  of 
three  variables.    We  have 


=  h 

=  1-64485 

=  -484,795 

=  < 

Ill 

=  ^1 

=  1-644,850 

=  <' 

III 

—  ^2 

=  1-705,532 

// 

=  —  -484,356 

< 

=  < 

III 

=  -  5-913,290 

Whence,  after  some  arithmetical  reduction,  we  find 

(Q  -  QoVQo  =  20-0389. 

But  Qo  =  -io  X      X  -4-  N  =  w^o-o  N.    Hence  Q  =  -00263  N. 

We  must  now  distinguish  between  the  absolute  and  relative  production  of  excep- 
tional men  by  exceptional  and  non-exceptional  parents.  The  exceptional  jDairs  of 
parents  are  obtained  by  (xix,),  whence  we  deduce,  putting  r  =  -2,  h  =  Jc  =  1-64485, 

~  N  W  ~  N  ~  400  ~  ^^^^4^- 

Whence  the  number  of  pairs  of  parents,  both  exceptional 

=  -005245  N. 

Thus,  -005245  N  pairs  of  exceptional  parents  produce  -00263  N  exceptional  sons, 
and  -994755  N  pairs  of  parents,  non-exceptional  in  character,  produce  -04737  N 
exceptional  sons,  i.e. ,  the  remainder  of  the  N,  The  rates  of  production  are  thus  as 
'5014  to  -0476.  Or  :  Pairs  of  exceptional  parents  produce  exceptional  sons  at  a  rate 
more  than  ten  times  as  great  as  p)airs  of  non- exceptional  parents.  At  the  same  time, 
eighteen  times  as  many  exceptional  sons  are  born  to  non-exceptional  as  to  exceptional 
parents,  for  the  latter  form  only  about  |-  per  cent,  of  the  community. 

The  reader  who  will  carefully  investigate  Illustrations  II.,  VIII.,  and  IX.  will  grasp 
fully  why  so  many  famous  men  are  born  of  undistinguished  parents,  but  will,  at  the 
same  time,  realise  the  overwhelming  advantage  of  coming  of  a  good  stock. 
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PROFESSOR  K.  PEARSON  AND  DR.  A.  LEE  ON 


NOTE. 

This  memoir  was  originally  presented  to  the  Society  on  August  5,  1899,  and  read  on  November  16, 
1899.  In  working  out  by  the  same  theory  the  coefficients  of  inheritance  for  Basset  Hounds,  Mr.  Leslie 
Bramley-Moore  discovered  that  the  method  adopted  was  not  exact  enough  in  its  process  of  propor- 
tioning. Accordingly,  with  the  assistance  of  Mr.  L.  N.  G.  Filon,  we  immensely  developed  the  theory,  so 
that  it  was  necessary  to  rewrite  the  theoretical  part  of  the  original  memoir.  This  has  been  carried  out  in 
Part  VIL  of  this  series.  The  present  memoir  consists  substantially  of  the  portions  of  the  original 
memoir  relating  to  the  inheritance  of  coat-colour  in  Horses  and  eye-colour  in  Man,  with  the  numerical 
details  and  the  resulting  conclusions  modified,  so  far  as  the  extended  theory  rendered  this  necessary.  In 
the  very  laborious  work  of  reconstructing  my  original  tables  I  have  received  the  greatest  possible  assistance- 
from  Dr.  Alice  Lee,  and  I  now  wish  to  associate  her  name  mth  mine  on  the  memoir.*  The  memoir 
was  at  my  request  returned  to  me  for  revision  after  it  had  been  accepted  for  the  'Philosophical 
Transactions.' 


Part  I. — Introductory. 

(1.)  A  CERTAIN  number  of  characters  in  living  forms  are  capable  of  easy  observation, 
and  thus  are  in  themselves  suitable  for  observation,  but  they  do  not  admit  of  an 
exact  quantitative  measurement,  or  only  admit  of  this  with  very  great  labour.  The 
object  of  the  present  paper  is  to  illustrate  a  method  by  which  the  correlation  of  such 
characters  may  be  effectively  dealt  with  in  a  considerable  number  of  cases.  The  con- 
ditions requisite  are  the  following  : — 

(i.)  The  characters  should  admit  of  a  quantitative  order,  although  it  may  he- 
impossible  to  give  a  numerical  value  to  the  character  in  any  individual. 

Thus  it  is  impossible  at  present  to  give  a  quantitative  value  to  a  brown,  a  bay,  or  a 
roan  horse,  but  it  is  not  impossible  to  put  them  in  order  of  relative  darkness  of  shade. 
Or,  again,  we  see  that  a  blue  eye  is  lighter  than  a  hazel  one,  although  we  cannot 
d  priori  determine  their  relative  ]30sitions  numerically  on  a  quantitative  scale. 

Even  in  the  markings  on  the  wings  of  butterflies  or  moths,  where  it  might  be 
indefinitely  laborious  to  count  the  scales,  some  half  dozen  or  dozen  specimens  may 
be  taken  to  fix  a  quantitative  order,  and  all  other  specimens  may  be  grouped  by 
inspection  in  the  intervals  so  determined. 

We  can  even  go  a  stage  further  and  group  men  or  beasts  into  simply  two- 
categories — light  and  dark,  tall  and  short,  dolichocephalic  and  brachycephalic — and 
so  we  might  ascertain  by  the  method  adopted  whether  there  is,  for  example,  correla- 
tion between  complexion  and  stature,  or  stature  and  cephalic  index. 

(ii.)  We  assume  that  the  characters  are  a  function  of  some  variable,  which,  if  we 

*  I  have  further  to  thank  Mr.  Leslie  Bramley-Moore,  Mr.  L.  N.  G.  Filon,  M.A.,  Mr.  W.  R.. 
Macdonell,  M.A.,  LL.D.  and  Miss  C.  D.  Fawcett,  B.Sc,  for  much  help  in  the  arithmetic,  often  for 
laborious  calculations  by  processes  and  on  tables,  which  were  none  the  less  of  service  if  they  were 
afterwards  discarded  for  others.  To  Mr.  Bramley-Moore  I  owe  the  extraction  and  part  of  the 
arithmetical  reduction  of  the  horse-colour  tables. 
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could  determine  a  quantitative  scale,  would  give  a  distribution  obeying — at  any  rate 
to  a  first  approximation — the  normal  law  of  frequency. 

The  whole  of  the  theoretical  investigations  are  given  in  a  separate  memoir,  in 
which  the  method  applied  is  illustrated  by  numerical  examples  taken  from  inheri- 
tance of  eye-colour  in  man,  of  coat-colour  in  horses  and  dogs,  and  from  other  fields. 
We  shall  not  therefore  in  this  paper  consider  the  processes  involved,  but  we  may 
make  one  or  two  remarks  on  the  justification  for  their  use.  If  we  take  a  problem 
like  that  of  coat-colour  in  horses,  it  is  by  no  means  difficult  to  construct  an  order  of 
intensity  of  shade.  The  variable  on  which  it  depends  may  be  the  amount  of  a 
certain  pigment  in  the  hair,  or  the  relative  amounts  of  two  pigments.  Much  the 
same  applies  to  eye-colour.  In  both  cases  we  may  fail  to  obtain  a  true  quantitative 
scale,  but  we  may  reasonably  argue  that,  if  we  could  find  the  quantity  of  pigment, 
we  should  be  able  to  form  a  continuous  curve  of  frequency.  We  make  the  assump- 
tion that  this  curve — to  at  any  rate  a  first  approximation — is  a  normal  curve.  Now 
if  we  take  any  line  parallel  to  the  axis  of  frequency  and  dividing  the  curve,  we 
divide  the  total  frequency  into  two  classes,  which,  so  long  as  there  is  a  quantitative 
order  of  tint  or  colour,  will  have  their  relative  frequency  unchanged,  however  we,  in 
our  ignorance  of  the  fundamental  variable,  distort  its  scale.  For  example,  if  we 
classify  horses  into  bay  and  darker,  chestnut  and  lighter,  we  have  a  division  which  is 
quite  independent  of  the  quantitative  range  we  may  give  to  black,  brown,  bay, 
chestnut,  roan,  grey,  &c. 

Precisely  the  same  thing  occurs  with  eye-colour ;  we  classify  into  brown  and  darker, 
hazel  and  lighter,  and  the  numbers  in  these  classes  will  not  change  with  the 
quantitative  scale  ultimately  given  to  the  various  eye-tints.  Our  problem  thus 
reduces  to  the  following  one  :  Given  two  classes  of  one  variable,  and  two  classes  of  a 
second  variable  correlated  with  it,  deduce  the  value  of  the  correlation.  Classify  sire 
and  foal  into  bay  and  darker,  chestnut  and  lighter  ;  mother  and  daughter  into  brown 
and  darker,  hazel  and  lighter,  and  then  find  the  correlation  due  to  inheritance 
between  the  coat-colour  or  eye-colour  of  these  pairs  of  relations.  The  method  of 
doing  this  is  given  in  Memoir  VII.  of  this  series.  Its  legitimacy  depends  on  the 
assumptions  (i.)  and  (ii.)  made  above,  which  may  I  think  be  looked  upon  as 
justifiable  approximations  to  the  truth. 

Of  course  the  probable  error  of  the  method  is  larger  than  we  find  it  to  be  when  cor- 
relation is  determined  from  the  product-moment.  Its  value  varies  with  the  inequality 
of  the  frequency  in  the  two  classes  given  by  the  arbitrary  division.  It  will  be 
least  when  we  make  that  frequency  as  nearly  equal  as  possible— a  result  which  can 
often  be  approximately  reached  by  a  proper  classification.  In  our  present  data  the 
probable  errors  vary  from  about  "02  to  -04,  values  which  by  no  means  hinder  us  from 
drawing  general  conclusions,  and  which  allow  of  quite  satisfactory  general  results. 

(2.)  So  far  we  have  only  spoken  of  the  two  classes,  which  are  necessary  if  we 
merely  want  to  determine  the  correlation.     But  if  we  wish  to  deal  with  relative 
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variability  we  must  have  more  than  two  classes.  We  have,  in  fact,  in  our  tables 
preserved  Mr.  Galton's  eight  eye- colour  classes  and  the  seventeen  classes  under  which 
the  coat-colour  of  thoroughbred  horses  is  classified  in  Wethekby's  studbooks.  Such  a 
classification  enables  us  at  any  rate  approximately  to  ascertain  relative  variability, 
and,  what  is  more,  to  reconstruct  approximately  the  quantitative  scale  according  to 
which  the  tints  must  be  distributed  in  order  that  the  frequency  should  be  normal. 
For,  in  order  to  attain  this  result,  we  have  to  ascertain  from  a  table  of  the  areas  of 
the  normal  curve  the  ratio  of  the  length  of  the  abscissa  to  the  standard  deviation  which 
corresponds  to  any  given  increase  of  frequency.  Let  us  suppose  that  three  classes 
have  been  made — Jij,  7i,,  n.^,  represented  by  the  areas  of  the  normal  curve  in  the 
accompanying  diagram  so  marked.     Let      and     be  the  distances  of  the  mean  from 


/ 


0  (3!^ 

the  two  boundaries  of  rig-  Here  p)^  may  be  negative,  or  p^  infinite,  &c. 
A  J  z=  jp^ja-,      =  i^g/o",  we  find  at  once,  if  N  =  total  frequency, 


Then  if 


N 

71^  +  '/ij  ~ 

N 


'dx 


(i.). 
(ii.). 


Now  the  integrals  on  the  right  are  tabulated,  and  thus,  since  the  left-hand  side  is 
a  known  numerical  quantity,  it  follows  that  Pi/a-  and  Pg/cr,  and  accordingly  the  range 
(Ps  — Pi)/^  of       class  in  terms  of  the  standard  deviation,  are  fully  determined. 

Thus,  if  e  be  the  range  on  the  scale  of  tint  or  colour  of  the  group  of  which  the 
observed  frequency  is  n^,  we  have  e  =  p^  —  p^,  and  thus  e/'cr  =  q  say,  is  known. 
For  a  second  series  e/cr'  =  q'.  Hence  o-/o-'  =  q'/q,  and  accordingly  the  ratio  of  the 
variabilities  of  the  two  series  is  determined. 

Again,  the  ratio  Pil{p-^  —  Pi)  enables  us  to  find  the  position  of  the  mean  in  terms 
of  the  range  on  the  scale  occupied  by  the  tint  corresponding  to  the  frequency  n^. 
As  a  rule  we  shall  take  this  tint  to  be  that  in  which  the  mean  actually  lies,  in  which 
case  we  shall  have  pj{pz  +  Pi)  as  determining  the  ratio  in  which  the  mean  divides 
the  true  quantitative  range  of  this  particular  tint. 

(3.)  Let  -n  =  Px{Pz-Pi)  =  Ih/iK-h)    '  (iii-). 

^  =  <x/o-'  =  (V-V)/(A3-A,)  (iv.). 

It  remains  to  find  the  probable  errors  of  these  quantities. 
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Suppose      to  be  the  standard  deviation  for  the  errors  in  a  quantity  x,  and 
the  correlation  coefficient  for  errors  in  two  quantities  x  and  y. 
Further  let 

where  subscripts  and  dashes  may  be  attached  to  H  to  correspond  to  hke  distin- 
guishing marks  attached  to  h. 
Since 

we  have  at  once  Swj  =  NHjS/ii, 

and  ■  tj.,  =  W(NHi)  (vii.). 

Similarly,  871^^  =  —  NHgS/ig,  whence  : 

t,^  =  tJ(NR,)  (viii.). 

Further,  we  have  =  —  t,t^%i^H,J(^^^iH.s)  (^^O  ' 

but,  as  is  shewn  in  Part  VII. ,  §  4, 

2  _  V  2  —  %(^-^3)  \ 

-^"1  —       X       '         ^"3  —       N   v^-/' 

tn,t„B.„^.^  =   (Xl.). 


It 

Thus  we  find 


Probable  error  of     =  '67449^/,^ 

_  -67449  1      A(N-^,)  .  ..  . 

-  v/N  H,  V  — m —  ^''''•>'- 

Probable  error  of     =  \/ '''^^j^7'-  {y^\\\.). 

Correlation  in  errors  in     and  ^3,  or  R/^/^j,  is  given  by 

th'^h'^k.h,  =  j^3g|^;  (xiv.). 

Let  w  =  /ig  —  u  =  /z-g'  —  }i(  be  the  ratio  to  the  respective  standard  deviations 
of  the  ranges  corresponding  to  the  groups  ng  and  n^'.  Then 


W\  +  H1H3J  ' 


whence,  if    be  a  proportional  frequency  =  ^^/N,  we  readily  find 

-Probableerrorof^.=  +  .    ,    .  (xv.). 

Probableerrorof..=  -^-|  — +  .    .  (xvi.). 

M  2 
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I  now  proceed  to  determine  the  correlation  in  the  errors  made  in  determining  the 
ranges  corresponding  to  any  two  classes  of  any  two  variables  which  are  correlated. 
For  this  purpose  let  the  frequency  correlation  table  be  dressed  as  follows,  in  the 


diagram  below. 


Axis  of  PC  variAbLe.    o  x, 


ToC(a,Ls 


m, 


m 


'12 


m 


n. 


m 


m 


m 


'23 


m 


'St 


m 


'32 


m 


'S3 


ToCclLs. 


n 


ni 


/7 


Here  mij  denotes  the  frequency  of  individuals  common  to  the  two  classes  ra  and 
n'j.  Let  My  denote  its  "  conjugate,"  or  all  the  frequency  which  appears  in  neither  Ui 
nor  n'j ;  then 

N  =  My  +  ni  +  n'j  —  m;j  (xvii.). 

As  before,  we  have 


t  2  = 


^  3  _ 


?;,;(N  —  H;) 

N  ' 

M;,(N-M,;;) 

N 


—i/i'i  — 


N 


(xviii.). 
(xix.). 


Further,  since  mij  and  My  are  nmtually  exclusive,  we  have 

From  (xvii.)  we  have  for  small  variations 

S7ii  +  Sn'j  =  Sniij  —  SMy. 


Hence 


(XX.). 


(xxi). 


MATHEMATICAL  CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION.  85 
Substituting  the  values  given  above  we  find,  after  some  reductions, 

This  result,  which  is  extremely  simple  in  form,  gives  the  correlation  in  errors  made 
in  determining  the  frequencies  in  any  two  classes  whatever  of  any  two  correlated 
variables. 

I  next  proceed  to  find  the  correlation  between  errors  in  u  and  u\  the  ratio  of  the 
ranges  occupied  by  any  two  classes  to  their  respective  standard  deviations. 
We  have 

8^2  +  Sng  =  -  NH^SAj  ; 

=     —  NHgS/lg. 

Hence  -  K)  =  J&  +  ^^(w  " 


NHi  '    N  VHj  Hg 
Similarly  8(^3'  -  h/)  =         +  ^"n"  (l^  ~ 

Multiply  the  first  by  the  second,  and  summing  as  usual  for  all  possible  errors,  we 
have,  by  using  (xxii.) 

1   f  Nm^j  —  n^iici^'      N?»23  —  '>^2>^3  /  I  1  \  ■   Nmgj  —  n^'n^  /  1 


Nmgg  —  n^n./ 


Hi      Hg)(H/  Hg')} 


or 


Collecting  the  like  H's  we  find,  after  very  considerable  reductions, 

_  _L  f  Nmii  I  ^^^33  -  ^3^3'   ,  N7>iig  -  n^7i^'      Nmg^  -  n^n^'] 

""'       N  1     N^HiH/  N^HgHg'  N^H^Hg'  N^HgH/  J 

.    ,    .    .  (xxiii.), 

,     i„2,„ix„„,  _  ^  I  +  +     ^^^^     +  I    .    .  (xxni.  ). 


where  (Xij  =  Wy/N  =  proportional  frequency. 

A  glance  at  our  diagram  on  the  pi'evious  page  of  the  correlation  table  divided  into 
nine  classes,  shows  at  once  the  symmetrical  formation  of  this  result.  By  writing  at 
the  points  P,  Q,  S,  and  T,  the  ordinate  there  of  the  normal  surface,  on  the  supposition 
of  no  correlation  and  N  =  1,  the  construction  of  the  result  is  still  more  clearly 
brought  out. 

We  are  now  in  a  position  to  determine  the  probable  errors  of  17  and  4.    We  have 
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Hence 


_  h^%.i  r     (N  -  n^)       )u  (N  -  w.,)  _         'In^ii.^  | 


Or,  Probable  error  of  rj 


67449  /ti/tg  [  /  V 


where  u  is  the  range  —  h^,  and  and  1^3  are  the  proportional  frequencies,  as  before. 
Care  must  be  taken,  if  the  class  cover,  as  it  usually  will  in  our  present  investiga- 
tions, the  mean,  to  put  hi  negative  within  the  radical.  In  other  words,  for  a  class 
covering  the  mean  we  have 


Probable  error  of  17 

•67449     h,h,      \    V,  V,  jj^  ^yii 


Lastly  we  have 
or, 

Hence 


i  =  u'ju  , 

U   \  III  Vj 


v  = 

Thus  :   Probable  error  of  C 


=  •67449^1^4-^,  -~^^}  (xxvi.), 


where  we  have  by  (xv.),  (xvi.),  and  (xxiii.'''') 


^  ,3  — 


-ir/  /  //  /\2l 


~  N  1    HiH/  H3H3'  H1H3'  H3H/ 

where,  as  before,  /i-'s  and  vs,  represent  proportional  frequencies. 

In  the  following  investigations  on  coat-colour  and  eye-colour  inheritance  I  have 
not  thought  it  needful  to  give  in  every  one  of  the  thirty-six  relationships  dealt  with 
the  probable  errors  of  the  means,  ratio  of  variabilities,  and  the  coefficients  of  inheri- 
tance      ^,  and  r).      The  arithmetical  labour  would  have  been  too  great,  for  the 
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expressions  as  given  above  are  somewhat  complex.  It  is,  however,  necessary  to  keep 
the  approximate  values  of  these  probable  errors  in  view,  and,  as  our  results  classify 
themselves  easily  into  groups  for  which  our  data,  as  well  as  the  intensity  of  heredity, 
are  approximately  the  same,  one  series  of  these  errors  has  been  found  for  each  group. 

(4.)  If  we  have  ground  for  our  assumption  that  the  variable  at  the  basis  of  our 
tint  classification  can  be  so  selected  as  to  give  a  normal  distribution,  we  may  deter- 
mine the  relative  lengths  on  the  scale  of  that  variable  occupied  by  each  tint  or  shade. 

Thus  if  o-^  be  the  standard  deviation  of  the  variable  for  male  eye-colour,  0-3  for 
female  eye-colour,  I  measured  the  range  on  the  scale  in  terms  of  and  a.  for 
Mr.  Galton's  eight  eye-colour  tints  for  3000  cases  of  male  and  3000  cases  of  female 
eye-colour.    I  found  the  spaces  occupied  on  the  unknown  scale  to  be  as  follows  : — 


No. 

Tint. 

Range  in  tei'ms  of  0-0. 

Range  in  terms  of  o-j. 

1 

Light  blue  .    •    .    .  . 

00 

00 

2 

Blue,  dark  blue    .    .  . 

1-39276 

1-34918 

3 

Grey,  blue-green  .    .  . 

•73468 

•77596 

4 

Dark  grey,  hazel  .    .  . 

•40027 

■41992 

5 

Light  brown  .... 

•03893 

•00856 

6 

Brown  

•43679 

•35895 

7 

Dark  brown  

•84161 

•64167 

8 

Very  dark  brown,  black . 

00 

00 

These  results  are  not  so  regular  as  we  might  have  hoped  for,  on  the  assumption 
that  the  ratio  of  crjcr^  would  be  the  same  from  whatever  part  of  the  scale  it  be 
determined.  The  general  conclusion,  however,  would  be  that  o-^  is  slightly  larger 
than  o-^,  which  is  confirmed  by  other  investigations.  Actually  a  tint  may  be  rather 
vaguely  described,  and  where  the  data  were  obtained  by  untrained  observers  without 
the  assistance  of  a  plate  of  eye-colours,  a  good  deal  of  rather  rough  classification  is 
likely  to  have  taken  place.  I  do  not  think  it  would  be  safe  to  go  further  than  stating 
that  on  the  quantitative  colour  scale  the  tints  as  described  must  occupy  spaces  in 
about  the  following  proportions  : — 


Light 
Blue. 

Blue, 
Dark  Blue. 

Grey, 
Blue-Green. 

Dark  Grey, 
Hazel. 

Light 
Brown. 

Brown. 

Dark 
Brown. 

Very  dark  Brown, 
Black. 

00 

1^37 

•75 

•41 

•02 

•40 

•74 

00 

Taking  2000  colts  and  2000  fillies,  the  standard  deviations  being  ctj,  and  o-..  respec- 
tively, I  have  worked  out  the  coat-colour  ranges  in  terms  of  cto  and  ctj  for  each  of 
the  sixteen  colours*  occurring  in  the  records.    We  have  the  following  results  : — 

*  See  p.  92,  below, 
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1  lilt. 

Range  in  o-o. 

Range  in  o-j. 

Range  in  o-^. 

Range  in  o-^. 

1 

00 

00 

9 

•00000 

•00000 

2 

•12683 

•10768 

10 

1-96956 

3 

•00000 

•03313 

11 

•00000 

•00000 

4 

•91747 

1-11055 

12 

•02490 

•00000 

5 

•00000 

•00352 

13 

•00000 

•00000 

6 

•11059 

•10451 

14 

•00000 

•00000 

7 

1^34684 

1^27688 

15 

-00000 

•00000 

8 

•00000 

•00000 

16 

00 

00 

Here  again  it  seems  to  me  that  the  most  we  can  safely  do  is  to  consider  that  on  a 
suitable  scale  the  relative  lengths  occupied  by  the  classes  of  coat-colours  recognised 
by  thoroughbred  horse  breeders  would  be  somewhat  as  folloM^s  :— 


bl. 

bl./br. 

br./bl. 

br. 

br./b. 

b./br. 

b. 

b./cli. 

ch./b. 

ch. 

cb./ro. 

ro./cli. 

ro. 

ro  /gr. 

gr./ro. 

gr- 

00 

•12 

•02 

•00 

•11 

1^31 

•00 

■00 

1^99 

•00 
• 

•01 

•00 

•00 

•00 

00 

The  reader  must  carefully  bear  in  mind  that  these  represent  scale-lengths  occupied 
by  the  coat-colour  and  not  the  frequency  of  horses  of  these  individual  coat-colours. 
What  we  are  to  understand  is  this  :  that  if  eye-colour  in  man  and  coat-colour  in 
horses  were  measured  on  such  quantitative  scales  as  we  have  given  in  skeleton,  then 
the  distribution  of  the  frequency  of  the  several  colours  would  be  very  approximately 
normal.  The  actual  skeleton  scales  are  represented  in  the  accompanying  diagram, 
which  puts  them  at  once  before  the  eye. 


br/U. 


Normal  Scale      Colour  Ranges  in  Thoroughbred  Horses, 
b/br  ch/b. 


rojch 


Black 


Brown 


Bay 


Chestnut 


'hite 


ELjhn 


brib. 


Normal  Scale  of  Eye  Colour  Ranges  in  Man. 


chlroy\%i^r. 
y^r/h. 


Verij  Dark 


Dark  Brown 

Brown 

Hd-zeb 
OdrkGrey 

Grey 
Blue -Green 

Dark  blue. 
Blue 


Li^ht 
Blue 


U^tBrowh 

(5.)  It  is  necessary  here  to  draw  attention  to  a  distinction  of  some  importance  in 
heredity,  namely,  that  between  blended  and  exclu.sive  inheritance.  In  my  treatment 
of  the  law  of  ancestral  heredity,*  it  is  assumed  that  we  have  to  deal  with  a  quanti- 
tatively measurable  character,  and  that  the  ancestry  contribute  to  the  quantity  of 
this  character  in  certain  proportions  which  on  the  average  are  fixed  and  follow  certain 
definite  numerical  laws.    Such  an  inheritance  is  blended  inheritance.    But  another 


■  Roy.  Soc.  Proc.,'  vol.  62,  p.  386. 
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type  of  inheritance  is  possible.  We  may  have  one  progenitor,  prepotent  over  all  others 
and  absorbing  all  their  shares,  who  hands  down  to  the  offspring  not  a  proportion  of 
his  character,  but  the  whole  of  it  without  blend.  If  this  progenitor  is  a  parent  we 
have  exclusive  inheritance,  if  a  higher  ancestor  a  case  of  reversion.  I  have  dealt  at 
some  length  with  this  type  of  inheritance  under  the  title  of  the  Law  of  Reversion 
in  another  paper.*  We  must  consider  in  outline  the  main  features  of  such  inheritance, 
for  the  cases  of  eye-colour  in  man  and  coat-colour  in  the  horse  approximate  more  closely 
to  the  numerical  values  required  by  it,  than  to  those  indicated  by  the  law  of  ancestral 
heredity.  The  chief  feature  of  exclusive  inheritance  is  the  absolute  prepotency  of 
one  parent  with  regard  to  some  organ  or  character.  It  need  not  always  be  the 
parent  of  the  same  sex,  or  the  same  parent  throughout  the  same  family.  Some 
offspring  may  take  absolutely  after  one,  others  after  another  parent  for  this  or  that 
organ  or  character  only.  I  believe  Mr.  Galton  first  drew  attention,  in  his  '  Natural 
Inheritance'  (p.  139),  to  this  exclusive  or,  as  he  terms  it,  alternative  heritage 
in  eye-colour.  In  going  through  his  data  again  I  have  been  extremely  impressed  by 
it ;  even  those  cases  in  which  children  might  be  described  as  a  blend,  rare  as  they  are, 
are  quite  possibly  the  result  of  reversion  rather  than  blending.  If  we  suppose  exclu- 
sive inheritance  to  be  absolute,  and  there  to  be  no  blending  or  reversion,  it  is  not  hard 
to  determine  the  laws  of  inheritance.  Supposing  the  population  stable,  one-half  the 
offspring  of  parentages  with  one  parent  of  given  eye-colour  would  be  identical  with 
that  parent  in  eye-colour,  the  other  half  would  regress  to  the  general  population 
mean,  i.e.,  the  mean  eye-colour  of  all  parents.  Hence,  taken  as  a  whole,  the  regression 
of  children  on  the  parent  would  be  "5.  In  the  case  of  the  grandparent  the  regression 
would  be  "25  ;  of  a  great  grandparent  '125,  and  so  on.  With  an  uncle  a  quarter  of 
the  offspring  of  his  brother  will  be  identical  in  eye-colour  with  him,  the  other  three- 
quarters  will  regress  to  the  population  mean,  thus  the  regression  will  be  '25.  If  we 
have  n  brethren  in  a  family,  and  take  all  possible  pairs  of  fraternal  relations  out  of  it, 
there  will  be  \n{ii  —  1)  such  pairs ;  brothers  will  have  the  same  eye-colour  that  of 
one  parent,  the  other  \n  brother  that  of  the  other  parent.  Hence  selecting  any  one 
brother,  -|n  —  1  would  have  his  eye-colour,  and  on  the  average  would  have 
regressed  to  the  mean  of  the  general  population.  In  other  words,  the  coefficient  of 
regression  would  be  {\n  —  l)J{\n  —  1      ^n)  =  {\n  —  l)j{n  —  1). 


Accordingly  n  =  3  Regression  =  -25 

w  =  4  „  =  -3333 

n  =  47  „  =  -3649 

n  =  b  „  =  -375 

n  —  5-3  „  =  -3833 

n  —  6  ,,  =  '4 

11  —  00  ,,  =  '5 


*  'Roy.  Soc.  Pioc.,'  vol.  (56,  pp.  140  d  .^e)/. 
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It  will  at  first  appear,  therefore,  that  the  fraternal  regression  with  the  size  of 
families  actually  occurring  will  vary  from  '35  to  '4. 

To  some  extent  these  values  would  be  modified  by  assortative  mating,  which 
actually  exists  in  the  case  of  eye-colour.  The  correlations  between  parent  and 
offspring  and  between  brothers  would  both  be  slightly  increased.  Thus  if  p  be  the 
coefficient  of  assortative  mating,  Ry  the  fraternal  correlation  with  and  Vf  without 
assortative  mating,  and  r  the  coefficient  for  parent  and  offspring,* 

%  -   1  _  2^  • 
If  we  put  Vf  =  -36,  r  =  -5,  p  =  '1,  we  find 

Ry  =  -39. 

Thus  we  see  that  the  regression  or  correlation  for  fraternal  inheritance  in  the  case 
of  exclusive  inheritance  could  not,  with  the  average  size  of  families,  be  very  far  from 
•4  of  blended  inheritance. 

A  further  source  which  can  modify  immensely,  however,  the  fraternal  correlation  is 
the  prepotency  of  one  or  other  parent,  not  universally,  but  within  the  individual  family. 
In  the  extreme  case  all  the  offspring  might  be  alike  in  each  individual  family.  Thus 
fraternal  correlation  might  be  perfect  although  parental  correlation  were  no  greater 
than  "5.  Hence,  where  for  small  families  we  get  a  fraternal  correlation  greater  than 
'4  to  "5,  it  is  highly  probable  that  there  exists  either  a  sex  prepotency  (in  this  case, 
one  of  the  parental  correlations  will  be  considerably  greater  than  the  other)  or  an 
individual  prepotency  (in  which  case  the  parental  correlations  based  on  the  average 
may  be  equal).  We  shall  see  that  fraternal  correlations  occur  greater  than  '5  in  our 
present  investigations.  I  have  dealt  with  these  points  in  my  Memoir  on  the  '  Law  of 
Reversion,'!  and  also  in  the  second  edition  of  the  '  Grammar  of  Science. '| 

Another  point  also  deserves  notice,  namely,  that  with  the  series  '5,  "25,  '125,  &c., 
for  the  ancestral  coefficients  in  the  direct  line,  the  theorems  proved  in  my  Memoir  on 
Regression,  Heredity,  and  Panmixia§  for  the  series  of  coefficients  r,  r^,  .  .  .  exactly 
hold.  That  is  to  say,  if  we  have  absolutely  exclusive  inheritance,  the  partial  regres- 
sion coefficients  for  direct  ancestry  are  all  zero  except  in  the  case  of  the  j^arents. 
This  it  will  be  observed  is  not  in  agreement  with  Mr,  Galton's  views  as  expressed  in 
Chapter  VIII.  of  the  '  Natural  Inheritance.'  But  I  do  not  see  how  it  is  possible  to 
treat  exclusive  inheritance  on  the  hypothesis  that  the  parental  regression  is  about  "3.11 
Actual  investigation  shows  that  for  this  very  character,  i.e.,  eye-colour,  it  is  nearer  '5. 
If  we  take  Table  XIX.  of  Mr.  Galton's  appendix,  and  make  the  following  groups,  both 

■*  This  is  shown  in  a  paper  not  yet  published  on  the  influence  of  selection  on  correlation. 
+  'Roy.  Soc.  Proc.,'  vol.  66,  pp.  140  et  seq.. 

X  "  On  Prepotency,"  p.  459  ;  "  On  Exclusive  Inheritance,"  p.  486. 

§  'Phil.  Trans.,'  A,  vol.  187,  p.  302,  etc. 

II  Mr.  Galton  takes  .'.  throughout  his  arithmetic. 


MATHEMATICAL  CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION.  91 

parents  light,  one  parent  light  and  one  medium,  one  light  and  one  dark,  we  reach  the 
following  results  : — 


Children,  actual. 

Light-eyed  children,  calculated. 

Parents'  eye-colour. 

Total. 

Light-eyed. 

Exclusive 
inheritance. 

Ancestral  law  with 
knowledge  of  parents 
and  grandparents. 

Both  light .... 
Light  and  medium  . 
Light  and  dark  .  . 

355 
215 
211 

334 
170 
107 

355 
161 
105 

321 
160 
117 

Here  the  exclusive  inheritance  leads  us  to  misplace  thirty-two  and  the  ancestral 
law  thirty-three  children.  The  evidence,  therefore,  of  the  correctness  of  the  latter  is 
hardly  greater  than  that  of  the  former.  Indeed,  if  the  former  were  modified  for 
reversion,  it  would  very  possibly  give  better  results  than  the  latter, 

I  am  inclined  accordingly  to  look  upon  eye-colour  inheritance  as  an  exclusive 
inheritance  modified  by  reversion,  and,  to  some  extent,  by  assortative  mating,  rather 
than  a  mixture  of  exclusive  inheritance  with  a  slight  amount  of  blending.  In  either 
case  exclusive  inheritance  gives  results  like  the  above  so  closely  in  accord  with  the 
ancestral  law  that  the  latter  might  be  supposed  to  hold.  But,  theoretically,  I  do  not 
understand  how  the  ancestral  law  is  compatible  with  exclusive  inheritance. 
Theoretically,  we  have  values  of  parental,  avuncular,  and  grand-parental  correlation 
greater  than  the  ancestral  law  would  permit  of,  and  these  theoretical  values  are,  on 
the  whole,  closer  to  observation,  as  we  shall  see  in  the  sequel,  than  those  given  by  the 
law  of  ancestral  heredity.    The  following  table  gives  the  two  systems  : — 


Table  I. — Theoretical  Values  of  the  Regression  Coefficients. 


Relationship. 

Blended  inheritance, 
ancestral  law. 

Exckxsive  inheritance, 
absolute,  no  reversion. 

Parent  and  offspring  ..... 
Grandparent  and  offspring  .  .  . 
Great-grandparent  and  offspring  . 

•3 

•15 
■075 
•4 
•15 

•5 
•25 
•125 
•35  to  -5* 
•25 

Now,  if  exclusive  inheritance  be  modified  by  reversion  or  assortative  mating,  or  if 
blended  inheritance  be  modified  by  "  taxation,"!  then  we  shall  get  values  different 

*  This  varies  with  the  size  of  the  family, 
t  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  402, 
N  2 
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from  tlie  above,  and  possibly  filling  up  the  numerical  gap  between  them.  To  this 
point  I  shall  return  when  dealing  witli  tlie  observed  values  for  eye-colour  in  man, 

Paet  II. — On  Colour-Inheritance  in  Thoroughbred  Racehorses. 

(6.)  All  the  data  were  extracted  from  Weatherby's  stud-books,  the  colours  bein^ 
those  of  the  horses  as  yearlings.  My  first  step  was  to  form  an  order,  not  a  quantita- 
tive scale,  of  horse-colours.  With  this  end  in  view,  the  recorded  colours  were 
examined,  and,  including  the  arabs,  the  following  seventeen  colours  were  at  first 
found  : — 


1.  Black  (bl). 

2.  Black  or  brown  (bl./br.). 

3.  Brown  or  black  (br./bl.). 

4.  Brown  (br.). 

5.  Brown  or  bay  (br./b.). 

6.  Bay  or  brown  (b./br.). 

7.  Bay  (b.). 

8.  Bay  or  chestnut  (b./ch.). 

9.  Chestnut  or  bay  (ch./b.). 


10.  Chestnut  (ch.). 

11.  Chestnut  or  roan  (ch./ro.). 

12.  Roan  or  chestnut  (ro,/ch.). 

13.  Roan  (re). 

14.  Roan  or  grey  (ro./g.). 

15.  Grey  or  roan  (g./ro,). 

16.  Grey  (g.). 

17.  White  (w.). 


Now,  if  we  take  the  alternative  colours  to  mean  that  the  first  alternative  is  the 
prominent  element,  we  see  that  these  colours  in  use  among  breeders  admit  of  only  one 
arrangement  from  Mack  to  ivhite.  That  is  to  say,  that  a  continuous  shade-change  is 
actually  in  use  for  the  colour-nomenclature  of  thoroughbred  horses,*  Thus  without 
any  hypothesis  as  to  the  quantitative  relative  values  of  bay  or  roan,  we  have  an  order 
which  serves  for  all  our  present  purposes.  Following  this  order.  Appendix  I.,  Tables 
I. — XII.,  for  the  colour  correlation  of  related  pairs  of  horses  was  compiled  by 
Mr.  Leslie  Bramley-Moore  from  the  stud-books.  When  dealing  with  relationship 
m  the  ?  line  only,  no  weight  has  been  given  to  fertility,  as  each  mare  has  had  only 
one  foal  attributed  to  it,  or  two  in  the  case  of  fraternal  correlation.  In  the  case 
of  the  (?  line,  the  colours  of  the  older  sires  were  harder  to  ascertain,  and  we  did  not 
obtain  altogether  more  than  600  sire-colours.  Thus  one,  two,  or,  in  a  few  cases,  three 
or  four  colts  or  fillies  were  taken  from  each  sire, 

I  shall  now  discuss  the  results  which  may  be  drawn  from  these  tables  for  the  theory 
of  heredity,  first  placing  in  a  single  table  all  the  numerical  constants  calculated  from 
the  data  in  Tables  I.  to  XII.  of  Appendix  I. 


*  Among  the  6000-8000  horses  dealt  with  only  four  were  found  with  colours  not  entered  in  this  scale, 
but  these  entries  of  bl./ch.,  br./ch.,  b./ro.,  in  no  way  contradict  the  order  of  the  scale,  but  merely  show  a 
rougher  appreciation  on  the  part  of  the  nomenclator,  or  possibly  printers'  or  editor's  errors. 
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Table  II.— Coat-colour  Inheritance  in  Thoroughbred  Horses. 


Pair  of  relatives. 

Division  of 
bay  range  by 
the  naean 

Ratio  of 
variabilities. 

Coeffi- 
cients of 
correla- 
tion. 

Coefficients  of 
regression. 

Num- 
ber of 
cases. 

X. 

i  = 

i  = 

-J" 

Ra'.i/. 

Ri/.i;. 

N. 

Sire  

Colt.  . 
Filly  . 
Colt.  . 
Filly  . 

•6111 
•6061 
•5359 
•5565 

•5713 
•5719 
•6027 
•6051 

•8712 
•8298 
•9500 
•9036 

M478 
r2051 
1^0526 
M067 

•4913 
•5422 
•4862 
•5668 

•4280 
•4499 
•4619 
•5122 

■5639 
•6534 
•5118 
•6273 

1300 
1050 
1000 
1000 

Maternal  grandsire 
Maternal  grandsire 

Colt.  . 
Filly  . 

•6583 
•6359 

•5867 
•6042 

•7678 

1^3024 

•3590 
•3116 

•2392 

•4058 

1000 
1000 

Colt  

(Half 

Colt  

(mole 

Filly  

(Half 

Filly  

(Whole 

Filly  

(Half 

Filly  

(Whole 

Colt.  . 
siblings) 

Colt.  . 
siblings) 

Filly  . 
siblings) 

Filly  . 
siblings) 

Colt.  . 
siblings) 

Colt.  . 
siblings) 

•5908 
■5620 
•5665 
•5684 
•5633 
•5410 

•5908 
•5620 
•5665 
•5684 
•5865 
•5711 

1 
1 
1 
1 

•9607 
•9555 

1 
1 
1 
1 

r0409 
1^0466 

•3551 
•6232 
•4265 
•6928 
■2834 
•5827 

•3551 
•6232 
•4265 
•6928 
•2723 
■5568 

•3551 
•6^32 
•4265 
•6928 
•2950 
1^0466 

2000 
2000 
2000 
2000 
1000 
1000 

In  this  table  R^^^  =  '}\,yO;r/ (t,^,  Ry.i.  =  rjy(T,J<T.r.  Half-siblings''^  are  those  having  the 
same  dam,  but  different  sires.  Further,  r)  is  measured  from  the  brown  end  of  the  bay 
range  up  to  the  mean. 

(7.)  On  the  Mean  Coat-Colour  of  Horses. — If  our  theory  be  correct,  this  colour 
will  not  differ  much  from  the  median  colour,  and  we  notice  at  once  a  secular  change 
going  on.    We  have  the  following  order  :  — 


Maternal  grandsire  of  colt   =  -6583 

Maternal  grandsire  of  filly   =  -6359 

Sire  of  colt   =  -6111 

Sire  of  filly   =  -6061 

Colt  (mean  value  of  seven  series)  .    .    .  =  -5816 

Dam  of  colts   rj  =  "5359 

Dam  of  fillies   =-5565 

Fillies  (mean  value  of  seven  series)  .    .  =  -5753 


*  I  have  introduced  this  expression  in  my  paper  on  "  The  Law  of  Reversion,"  '  Roy.  Soc.  Proc.,'  vol.  66, 
as  a  convenient  expression  for  a  pair  of  offspring  from  same  parents  whatever  be  their  sex. 
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Now  the  colours  of  all  the  horses  are  returned  when  they  are  foals,  so  that  there  is 
no  question  of  any  variation  of  colour  with  age,  yet  we  notice  that — 

(i.)  The  horse  is  lighter  in  colour  than  the  mare. 

(ii.)  If  we  go  back  two  generations  (grandsire)  the  horse  is  lighter  than  if  we  only 
go  back  one  generation  (sire),  and  the  sires  are  again  lighter  than  their  colts. 
In  other  words,  there  seems  a  progressive  change  towards  a  darker  coat. 

(iii.)  On  the  other  hand,  the  mares  one  generation  back  appear  to  be  darker  than 
their  daughters. 

(iv.)  The  average  sire  of  colts  is  lighter  than  the  average  sire  of  fillies  ;  the  average 
dam  of  colts  is  darker  than  the  average  dam  of  fillies. 

Now  these  conclusions  seem  to  indicate  that  the  older  horse  was  lighter  in  coat,  and 
the  older  mare  darker  in  coat  than  either  the  colt  or  filly  of  to-day,  and  that  there  is  a 
tendency  in  the  thoroughbred  racehorse  of  to-day  to  approach  to  an  equality  of  colour 
in  the  two  sexes,  an  equality  which  is  a  blend  of  the  sensibly  divergent  sex-colour  of 
the  older  generation. 

Whether  this  secular  change  is  due  to  the  "  breeding  out "  of  the  influence  of  light 
Arabian  sires,  or  to  a  tendency  in  the  past  to  select  light-coloured  sires  and  dark- 
coloured  mares  for  breeding,  or  to  the  fact  that  such  coloured  sires  and  mares  are  the 
most  fertile,  i.e.,  to  an  indirect  effect  of  reproductive  selection,  is  not  so  easy  to 
determine.  But  what  does  appear  certain  is  that  the  average  thoroughbred  is 
approaching  to  a  blend  between  its  male  and  female  ancestry,  which  were  sensibly 
divergent."* 

(8.)  On  the  Relative  Variability  of  Sex  and  Class  in  Horses.—  The  following  table 
gives  the  length  of  the  bay  range  in  terms  of  the  standard  deviation  for  each  class. 
If  e  represent  this  range,  then  in  terms  of  the  previous  notation  e  —  itX  a  -—  u'  X  <r', 
and  from  these  values  of  u  and  u'  the  ratio,  C  =  or  J  a'  of  Table  II.  was  calculated. 


*  Mean  of  dams  and  sires  of  colts  =  -5735,  i.e.,  +  -5359). 

Mean  of  dams  and  sires  of  fillies  =  -5813,  i.e.,  i(-6061  +  -5565). 
These  are  curiously  enough  almost  exactly  equal  to  the  mean  values  -5753  and  -5816  obtained  for  fillies 
and  colts.    This  nmrse  I'elationship  is  too  close  to  the  probable  errors  of  the  quantities  under  investiga- 
tion for  real  stress  to  be  laid  on  it,  lint  it  may  still  turn  out  to  be  sixggestive. 
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III. — Table  of  Bay  Ranges. 


Relative  Pair. 

Bay  Range. 

Probable  Error  of  Median. 

X. 

y- 

u'  X  a-y. 

X. 

y- 

Dam  

Colt    .  .  .  .* 

Filly  .... 
Colt  .... 
Filly  .... 

1  •46943(7-^ 
1-64075(7^ 
l-36645cr^ 
1 -38165(7^ 

l-28019tr,/ 
l-36149(r;, 
l-29819a-y 
1-248450-,, 

-(-•0160 
+  •0159 
+  •0196 
±•0193 

+  ^0183 
+  ^0192 
+  ^0206 
±•0214 

Maternal  grandsire  . 
Maternal  grandsire  . 

Colt  .... 
Filly  .... 

l-69694cr^ 
l-65021cr^ 

1 -1929307,, 
1 -267020-;, 

±•0158 
±•0162 

+  ^0224 
±•0211 

Colt  

(Half 

Colt  

(Whole 

Filly  

(Half 

Filly  

(Whole 

Filly  

(Half 

Filly  

(Whole 

Colt  .... 
siblings) 

Colt  .... 
siblings) 

Filly            .  . 
siblings) 

Filly  .... 
siblings) 

Colt  .... 
siblings) 

Colt  .... 
siblings) 

1 -239530-^ 
l-27688cr^ 
1-396190-x 
1 -346840-^ 
1-334790-^ 

l-41501(r^ 

1-239530-j, 
l-27688(Ty 
1-396190-,, 
l-34684o-y 
l-28229a-y 
l-35207o-y 

±-0153 
±-0148 
±-0135 
±•0140 
±  ^0202 
±•0189 

±•0153 
±•0148 
±•0135 
±-0140 
±  -0208 
±-0198 

To  explain  the  last  double  column  I  note  that  Mr.  Sheppard  has  shown  ('  Phil. 
Trans.;  A,  vol.  192,  p.  134)  that  the  probable  error  of  the  median  equals 

•84535  cr/-v/N. 
Hence  in  terms  of  the  bay  range  we  have 


probable  error  of  median       ^         ,,       — , 
length  of  bay  range  "  =  •84535/(^.,/N). 


I  have  found  that  this  simple  result  gives  a  value  close  to  but  slightly  larger  than 
the  probable  error  of  the  quantity  77  (p.  82),  from  which  I  have  determined  the 
position  of  the  mean  in  the  bay  range.  It  is  much  easier  to  calculate,  but  of  course 
not  so  exact,  as  we  take  no  account  of  possible  errors  in  the  bay  range  itself  or 
their  correlation  with  errors  in  the  median.  I  have  accordingly  tabulated  its  values 
m  the  last  double  column  as  a  rough  guide  to  the  errors  made  in  the  numbers 
upon  which  the  statements  in  the  previous  section  depend.  I  shall  return  to  the 
consideration  of  the  probable  errors  below.  Turning  to  columns  3  and  4  of  our 
Table  II.,  we  can  draw  the  following  conclusions  as  to  the  variability  of  sex  and 
class : — 

{a.)  The  Younge?-  Generation  is  more  Variable  than  the  Tlm.s,  foals  are  more 

variable  than  their  sires,  and,  looking  at  the  expressions  in  Table  III.  for  the  bay  range, 
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siies  than  grandsires.  This  is  a  rule  I  have  now  found  true  in  a  very  great  number 
of  cases  of  inheritance.  Parents  are  a  fairly  closely  selected  body  of  the  general 
population,  and  so  less  variable  than  that  jjopulation  at  large.  This  might  appear 
pretty  obvious  in  the  case  of  thoroughbred  horses  when  we  are  dealing  with  sires  and 
grandsires.  I  have  already  pointed  out  that  it  was  impossible  to  take  1000  to  1300 
colts  or  fillies  with  as  many  independent  sires,  the  fashion  in  sires  is  too  marked ;  and 
of  course  the  number  of  independent  grandsires  was  still  fewer.*  But  even  in  the 
case  of  dams,  where  we  have  taken  as  many  independent  dams  as  fillies,  we  see  this 
reduction  in  variability  in  the  older  generation.  As  it  also  occurs  with  stature,  &c., 
in  man  as  well  as  with  coat-colour  in  horses — in  which  latter  case  we  expect  artificial 
selection — it  deserves  special  consideration.  Without  weighting  with  fertility,  there 
exists  a  selection  of  the  individuals  destined  to  be  parents  in  each  generation.  We 
have  to  ask  whether  the  change  in  mean  and  variability  from  parent  to  offspring  in 
each  generation  is  secular  or  periodic,  or  to  what  extent  it  is  partly  one  and  partly 
the  other.  The  importance  of  settling  this  point  is  very  great ;  it  concerns  the 
stability  of  races.  So  far  as  my  fairly  numerous  series  of  measurements  yet  go, 
I  cannot  say  that  a  "  stable  population  "  has  definitely  shown  itself;  the  characters  of 
each  race  when  measured  for  two  generations  seem  to  vary  both  in  mean  and 
standard  deviation.  Palaeontologists  tell  us  of  species  that  have  remained  stable  for 
thousands  of  years,  but  this  is  a  judgment  hitherto  based  on  a  qualitative  apprecia- 
tion, A  quantitative  comparison  of  the  means,  variabilities,  and  correlations  of  some 
living  species  in  its  present  and  its  fossil  representatives  would  be  of  the  greatest 
interest  and  value.  For  myself,  I  must  confess  that  my  numerical  investigations  so 
far  tend  to  impress  me  with  the  unstable  character  of  most  populations. 

(6.)  There  is  fairly  good  evidence  that  the  horse  is  more  variable  than  the  mare  in 
coat-colour.  It  would  be  idle  to  argue  from  the  first  four  results  of  Table  III.  that 
the  mare  is  more  variable  than  the  horse,  in  that  these  results  show  the  dam  to  be 
more  variable  than  the  sire.  For,  as  we  have  shown,  the  process  of  breeding  and  our 
method  of  extracting  the  data  tend  to  produce  a  much  more  intense  selection  of  sires 
than  of  dams.  But  if  we  compare  the  mean  bay  range  in  terms  of  the  standard 
deviation  of  colts  for  our  seven  series  of  colts  with  that  for  the  seven  series  of  fillies 
in  Table  III.,  we  find  for  the  first  1  "27458  and  for  the  second  1 '33854  ay.  Hence 
we  are  justified  in  concluding  that  (Xc  is  greater  than  cry.  In  fact  in  only  one  case  out 
of  the  seven  does  the  series  of  fillies  give  a  less  variability  than  the  corresponding 
series  of  colts,  i.e.,  colts  corresponding  to  dams  are  somewhat  less  variable  than  fillies 
corresponding  to  dams.  It  must,  however,  be  remembered  that  this  conclusion  is 
based  upon  the  coat-colour  of  the  animals  recorded  as  yearling  foals,  t    Thus,  the  coat- 

*  For  some  account  of  the  extent  of  in  and  in  breeding  in  the  thoroughbred  horse,  see  my  memoir  on 
"  Reproductive  Selection,"  '  Phil.  Trans.,'  A,  vol.  192,  p.  257  d  seq. 

t  The  reader  must  always  bear  in  mind  that  when  we  speak  of  the  variability  of  colour  in  sire  or 
dam,  t^c,  it  means  the  variability  of  this  class  when  they  were  yearlings. 
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colour  may  change  both  in  intensity  and  variabihty  with  age,  much  as  variabihty  in 
stature  changes  with  children  from  birth  to  adult  life. 

(c.)  As  a  more  or  less  natural  result  of  {())  it  follows  that  any  group,  male  or 
female,  having  male  relatives  is  more  variable  than  the  same  group  ivith  female 
relatives.  Thus  sires  of  colts  are  more  variable  than  sires  of  fillies  ;  fillies  half-sisters 
to  colts  are  more  variable  than  fillies  half-sisters  to  fillies,  &c.  But  out  of  the  nine 
cases  provided  by  our  data  there  are  three  exceptions  to  the  rule,  and  perhaps  not 
much  stress  can  be  laid  on  it,  at  any  rate  in  the  above  form.  It  would  appear  that 
males,  relatives  of  males,  are  sensibly  more  variable  than  males  relatives  of  females. 
The  bay  ranges  are  1-3926  o-' and  1*4447  o-  respectively,  which  indicates  that  the 
average  a-'  is  larger  than  cr.  But  if  we  treat  the  groups  of  females  alone,  we  find  for 
females  with  male  relatives  the  bay  range  =  1-3694  cr,  and  for  females  with  female 
relatives  1-3433  cr',  which  indicates  that  the  latter  are  more  variable.  The  difference 
is,  however,  not  very  sensible.  Possibly  the  rule  is  simply  that  extremes  tend  to 
produce  their  own  sex,  but  our  data  are  not  sufiicient  for  really  definite  conclusions  on 
the  point. 

In  order  that  we  may  have  a  fair  appreciation  of  the  probable  errors  of  the 
quantities  involved  and  the  weight  that  is  to  be  laid  upon  their  differences,  I  place 
here  a  table"^  of  the  probable  errors  of  t^,  of  ^  =  <r.r/cr^  and  of  r^^y  for  typical  cases. 


IV. — Table  of  Probable  Errors. 


Relations. 

Vx 

i 

u 

Sire  and  Filly   .  . 

•0143 

•0170 

•0243 

•0363 

•0330 

•0288 

Grandsire  and  Colt 

•0143 

•0199 

•0237 

•0385 

•0319 

•0333 

Colt  and  Colt    .  . 

•0186 

•0186 

•0328 

•0328 

•0259 

(Whole  siblings) 

Filly  and  Colt  .  . 

•0179 

•0185 

•0315 

•0335 

•0328 

•0363 

(Half  siblings) 

It  will  be  seen  from  this  table  that  the  probable  error  in  t]  is  about  3  per  cent.,  in 
C  about  2  to  4  per  cent.,  in  u  about  2  to  2-5  per  cent.,  and  the  values  of  r  about  '03, 
growing  somewhat  larger  as  r  grows  smaller.  The  probable  errors  are  thus  some- 
what larger  than  those  which  we  obtain  by  the  old  processes  when  the  characters  are 
capable  of  quantitative  measurement,  but  they  are  not  so  large  as  to  seriously  aflfect 
the  use  of  the  new  processes  in  biological  investigations.  As  we  have  already 
indicated,  the  probable  errors  of  the  t^'s  may  be  roughly  judged  by  Mr.  Sheppard's 
formula  for  the  median  (p.  95). 

It  will  be  seen  that  the  differences  in  the  rj's  and  C's  of  Table  II.,  or  the  u's  of 

*  I  have  to  thank  Mr.  W.  R.  Macdonell  for  friendly  aid  in  the  somewhat  laborious  arithmetic 
involved  in  calculating  these  probable  eirors. 
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Table  III.,  are  as  a  rule  larger  than  the  probable  errors  of  the  differences,  sometimes 
several  times  larger.  Yet  in  some  cases  they  are  not  such  large  multiples  of  the 
probable  errors  of  the  differences  that  we  can  afford  to  lay  great  stress  on  the 
divergence  of  rj  or  {  or  u  in  a  pair  of  special  cases.  We  must  lay  weight  rather  on 
the  general  tendency  of  the  tables  when  all  the  series  are  taken  together.  Thus, 
while  we  may  have  small  doubt  about  the  correctness  of  (i.)  of  §  7  or  (b)  of  §  8,  we 
should  look  upon  (iv.)  of  §  7  as  an  important  suggestion  which  deserves  serious 
consideration  rather  than  a  demonstrated  law.  The  same  again  holds  good  for  (e)  of 
§8.  It  is  because  of  their  suggestiveness  that  they  are  here  included.  That  a 
differential  fertility  or  an  individualisation  in  the  sex  of  offspring  should  be  corre- 
lated with  colour,  would,  if  proved,  be  a  fact  of  very  considerable  interest.  It  would 
again  emphasise  the  important  part  which  genetic  selection  plays  in  the  modification  of 
characters.*  A  jpriori  it  is  not  more  unreasonable  to  expect  coat-colour  in  horses  than 
to  suppose  hair-colour  in  men  to  be  correlated  with  fertility.  But  the  fertility  of  man 
does  seem  to  vary  from  the  light  to  the  dark  races.  The  special  feature  of  interest 
here,  however,  is  that  a  different  colour  in  the  two  sexes  appears  to  influence  the 
preponderance  of  one  or  other  sex  in  the  offspring.  It  would  be  an  interesting 
inquiry  to  determine  whether  the  sex-ratio  in  the  offspring  of  "  mixed  marriages " 
varies  when  the  races  of  the  two  parents  are  interchanged. 

(9.)  On  the  Inheritance  of  Coat-colour  in  Thoroughbred  Horses.— (a.)  Direct  Line. 
First  Degree.— Raying  regard  to  the  probable  errors — about  -03 — in  the  values  of 
the  correlation  coefficient  iv^,  it  seems  quite  reasonable  to  suppose  that  the  mean 
parental  correlation,  actually  -5216,  is  practically  '5.  It  is  quite  impossible  to 
imagine  it  the  '3  of  Mr.  Galton's  view  of  the  Law  of  Ancestral  Heredity.  If  we 
adopt  the  view  of  that  law  given  in  my  paper  on  the  Law  of  Ancestral  Heredity, t 
and  take  the  coefficient  y  to  be  different  from  unity,  then  it  is  shown  in  my  paper  on 
the  Law  of  Eeversion^  that  we  cannot  on  the  theory  of  blended  inheritance  get 
parental  correlation  as  high  as  '5  without  values  of  the  fraternal  correlation  which 
are  much  higher  than  those  hitherto  observed,  certainly  much  higher  than,  as  we 
shall  see  later,  we  find  in  the  case  of  coat-colour  in  horses.  Coat-colour  in  horses  does 
not  thus  appear  to  be  at  all  in  accord  with  Mr.  Galton's  view  of  ancestral  inheritance, 
or  even  with  my  generalised  form  of  his  theory.  It  does  accord  very  well  with  what 
we  might  expect  on  the  theory  of  exclusive  inheritance  as  developed  above,  p.  91, 
on  the  assumption  that  there  is  no  reversion. 

Looking  at  the  matter  from  the  relative  standpoint,  we  see  that  not  much  stress  can 
be  laid  on  the  respective  influences  of  the  sire  and  dam  on  the  colt,  or  of  the  sire  and 
dam  on  the  filly  ;  but,  on  the  other  hand,  the  filly  does  appear  to  inherit  more  from 

*  See  the  concluding  remarks  in  the  memoir  on  "  Genetic  (Reproductive)  Selection,"  '  Phil.  Trans.,' 
A,  vol.  192,  pp.  257—330. 

t  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  386  et  seq. 
I  'Roy.  Soc.  Proc.,'  vol.  66,  p.  140  rt  seq. 
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both  parents  than  the  colt  does.  There  is  certainly  (judged  from  coat-colour)  no 
preponderance  of  the  sire's  influence  over  the  dam's  such  as  breeders  appear  occasion- 
ally to  imagine.  The  average  influence  of  the  dam  on  the  offspring  indeed  appears 
to  be  slightly  greater  than  that  of  the  sire,  but  the  difference  is  of  the  order  of  the 
probable  error,  and  not  of  the  overwhelming  character  exhibited  in  the  case  of  Basset 
Hounds.  There  is  indeed  in  the  case  of  thoroughbred  horses  not  the  same  chance  of 
carelessness  produced  by  a  misalliance  afterwards  screened  by  the  defaulter.  There 
exists,  however,  a  far  greater  premium— considering  the  great  value  of  yearlings  from 
fashionable  sires— set  upon  dishonesty.  Again  it  is  possible  that  when  stallions 
receive  too  many  public  or  private  mares,  or  are  still  used  in  their  old  age,  that  they 
may,  without  losing  the  power  of  fertilising,  lose  some  of  the  power  of  transmitting 
their  characters.  The  divergences,  so  far  as  the  probable  errors  are  concerned,  are 
not  such  that  we  are  forced  out  of  our  way  to  explain  them.  With  the  single 
exception  of  sire  and  colt  we  see  that  our  table  shows  the  universal  prevalence  of  the 
rule  that : 

Relatives  of  the  same  sex  are  more  closely  correlated  than  relatives  of  the  same 
grades  of  the  opposite  sex.    Thus  : — 

A  colt  is  more  like  his  sire  than  his  dam. 

A  filly  is  more  like  her  dam  than  her  sire. 

A  dam  is  more  like  her  filly  than  her  colt. 

A  grandsire  is  more  like  his  grand-colt  than  his  grand-filly. 

A  colt  is  more  like  his  brother  colt  than  his  sister  filly. 

A  filly  is  more  like  her  sister  filly  than  her  brother  colt. 

the  latter  two  cases  being  true  for  both  whole  and  half  siblings. 

The  solitary  exception  is  that  a  sire  is  more  like  his  filly  than  his  colt. 

If  we  were  to  assume  it  a  rule  that  a  filly  in  the  matter  of  coat-colour  has  stronger 
inheritance  all  round  than  a  colt,  we  should  find  it  agree  with  our  results  for  parental 
inheritance,  and  receive  considerable  support  for  the  much  stronger  correlation  of 
fillies  than  of  colts,  when  either  whole  or  half  siblings.  But  it  would  not  be  in 
accordance  with  our  results  for  grandparents,  for  which,  however,  we  have  only 
details  for  two  out  of  the  eight  possible  cases.  On  the  whole,  I  think  we  must 
content  ourselves  with  the  statements  that  parental  correlation  is  certainly  about  '5, 
and  that  with  high  probability  each  sex  is  more  closely  correlated  with  its  own  sex  of 
the  same  grade  of  relationship. 

(6.)  Direct  Line,  Second  Degree. — My  data  here  are  unfortunately  only  for  two 
cases  out  of  the  possible  eight.  I  hope  some  day  to  finish  the  series,  but  the  labour 
of  ascertaining  from  the  studbooks  the  coat-colour  of  700  or  800  separate  sires  is 
very  great.  Indeed  it  is  not  easy  to  follow  up  the  pedigree  through  the  male  line 
when  the  sire  is  not  one  of  the  famous  few.  On  the  other  hand,  it  is  much  easier 
through  the  female  line.    For  this  reason  the  maternal  grandsire  was  taken,  Even 
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in  this  case  we  had  to  seek  back  for  each  sire — the  year  of  whose  birth  was  unknown 
— until  we  found  the  record  of  his  coat-colour  given  under  the  heading  of  his  dam  in 
the  year  of  his  birth. 

The  average  of  our  two  cases  gives  a  coefficient  of  correlation  =  -3353,  the  colt 
having  a  greater  degree  of  resemblance  to  the  grandsire  than  the  filly.  This  value  is 
substantially  greater  than  the  "25  we  might  expect  for  exclusive  inheritance,  and 
more  than  double  the  value  "15,  to  be  expected  for  the  grandparental  correlation 
with  Mr.  Galton's  unmodified  law  for  blended  inheritance.  Of  course  the  '25  is  to  be 
expected  as  the  mean  of  the  eight  grandparental  series,  and,  as  we  shall  see  for  eye- 
colour  in  man,  these  may  vary  very  much  in  magnitude.  But  allowing  for  this,  it 
seems  quite  impossible  that  the  average  value  could  be  reduced  to  -15.  I  take  it 
therefore  that  the  grandparental,  like  the  parental,  data  point  to  a  law  of  inheritance 
other  than  that  which  has  been  described  in  my  paper  on  the  Law  of  Ancestral 
Heredity.  This  peculiar  strengthening  of  the  grandparental  heritage  has  already 
been  noted  by  me  in  my  paper  on  the  Law  of  Keversion,*  and  the  difficulties  of 
dealing  with  it  on  the  principle  of  reversion  therein  discussed.  There  may  be  some 
opinion  among  breeders  as  to  the  desirability  of  emphasising  the  dam's  strain  in  the 
choice  of  a  sire  which  leads  to  this  result,  but  if  so  it  is  unknown  to  me,  nor  do  I  see 
how  it  would  work  without  also  emphasising  the  correlation  of  the  dam  and  foal. 
The  mean  value  of  the  correlation  for  the  maternal  grandfather  and  grandchildren  for 
eye-colour  in  man  is  •3343 — a  result  in  capital  agreement  with  that  for  coat-colour  in 
horses.  In  that  case  the  average  of  the  eight  series,  as  we  shall  see  later,  is  con- 
siderably above  '25,  and  we  must,  I  think,  suspend  our  judgment  as  to  whether  this 
could  possibly  in  the  case  of  horses  be  the  true  mean  value.  As  to  the  value  "15 
it  seems  quite  out  of  the  question. 

As  already  remarked,  the  influence  of  the  maternal  grandsire  (unlike  that  of  the 
sire)  is  substantially  greater  on  the  colt  than  on  the  filly. 

(c.)  Collateral  Heredity,  F'irst  Degree.— Here  we  have  more  ample  data  to  go 
upon,  namely,  a  complete  set  of  six  tables  of  both  whole  and  half  siblings  of  both 
sexes. 

We  notice  one  or  two  remarkable  features  straight  off".  In  the  first  place,  in  the 
case  of  both  fillies  and  colts,  the  whole  siblings  of  the  same  sex  have  not  a  correlation 
the  double  of  that  of  the  half  siblings,  but  have  a  correlation  very  considerably  less 
than  this,  A  priori  we  might  very  reasonably  expect  the  one  to  be  the  double  of 
the  other.  This  is  what  would  happen  in  the  case  of  blended  inheritance  on  the 
hypothesis  of  equipotency  of  the  parents.  As  the  half  siblings  are  on  the  dam's  side, 
we  might  assert  a  considerable  prepotency  of  the  dam  over  the  sire.  This  cannot 
indeed  be  the  explanation  of  the  divergence  in  the  case  of  Basset  Hounds,  where  the 
half  siblings  have  a  correlation  considerably  less  than  half  that  of  whole  siblings,! 

*  'Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq. 
t  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq. 
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and  yet  the  prepotency  of  the  dam  in  coat-colour  is  very  marked.  But  in  the 
present  case  there  is  on  the  average  only  a  slight,  if  indeed  it  be  a  real,  prepotency  of 
the  dam.  Further,  if  we  turn  to  the  correlation,  no  longer  of  sibhngs  of  the  same 
sex,  but  of  opposite  sexes,  we  find  the  correlation  of  the  whole  siblings  is  approxi- 
mately double  that  of  the  half  siblings,  as  we  should  d  priori  have  expected. 

Taking  averages  on  the  assumption  that  the  correlation  for  whole  siblings  should 
be  double  that  for  half  siblings,  we  have  the  following  results 

Correlation  between  colts  based  on  results  for  whole  and  half  siblings  .  '6667 

Correlation  between  fillies  based  on  results  for  whole  and  half  siblings  .  7729 
Correlation  between  filly  and  colt  based  on  results  for  whole  and  half 

siblings   "5747 

Mean  correlation  of  siblings  based  upon  all  results  for  whole  siblings   .  "6329 

Mean  correlation  of  siblings  based  upon  all  results  for  half  siblings  .  .  '7100 
Mean  correlation  of  siblings  based  ujDon  results  for  both  whole  and 

half  siblings   "6714 

We  can  draw  the  following  conclusions  : — 

(i.)  In  whatever  manner  we  deduce  the  fraternal  correlation  it  is  very  much  larger 
than  the  '4  for  whole  brethren,  or  the  '2  for  half  brethren,  required  by  the  unmodified 
Galtonian  law.  Such  values,  as  we  see  above,  could  be  deduced  from  the  modified 
Galtonian  law  by  taking  y  greater  than  unity,"'^  but  this  would  involve  values  for 
the  parental  correlation  sensibly  less  than  those  given  by  theory.  We  are  again 
compelled  to  assert  that  the  modified  as  well  as  the  unmodified  theory  of  blended 
inheritance,  based  on  the  Galtonian  law,  does  not  fit  the  facts.  The  above  values, 
however,  are  quite  compatible  with  the  theory  of  exclusive  inheritance  on  the 
assumption  that  there  is  an  individual  (not  a  sexual)  prepotency  from  one  pairing  to 
another. 

(ii.)  In  whatever  way  we  consider  it,  it  would  appear  that  the  average  value  of 
the  fraternal  correlation,  as  deduced  from  siblings  with  the  same  dam  only,  is 
greater  than  that  deduced  from  siblings  with  both  the  same  dam  and  the  same  sire. 

I  am  not  able  to  explain  this  in  any  way,  for  I  cannot  assert  a  very  substantial 
prepotency  of  the  dam.  All  I  can  say  from  the  data  at  present  available  is  that  for 
horses  and  dogs  there  appears  to  be  no  simple  numerical  relation  between  the  correla- 
tion of  whole  and  half  brethren, 

(iii.)  Offspring  of  the  same  sex  are  more  alike  than  offspring  of  opposite  sexes. 

This  appears  to  be  generally  true,  so  far  as  our  data  at  present  extend,  and  will  be 
fairly  manifest  from  the  table  below. 


*  'Eoy.  Soc.  Proc.,'  vol.  66,  p.  140  c/"  seq^. 
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Table  V. — Collateral  Heredity. 


Man. 

Dog. 

Horse. 

i 

i 

Coat-colour. 

Pair. 

Stature.* 

Cephalic 
Index,  t 

Eye-colour.  | 

Coat-colour.§ 

i 

Whole 
Siblings. 

Half 
Siblings. 

Brother-Brother . 
Sister-Sister  .  . 
Brother-Sister 

•3913 
•4436 
•3754 

,  -3790 
•4890 
•3400 

•5169 
•4463 
•4615 

||    -5257  1 

•6232 
•6928 
•5827 

•3551 
•4265 
•2834 

It  will  be  noted  that,  with  the  single  exception  of  eye-colour  in  man,  sister  and 
sister  are  more  alike  than  brother  and  brother. 

The  mean  value  of  the  fraternal  correlation  for  stature  is  "4034,  and  for  cej)halic 
index  "4027.  These  results  are  in  excellent  accordance  with  the  "4  required  by  the 
Galtonian  theory  of  blended  inheritance.  The  mean  values  for  eye-colour  in  man, 
coat-colour  in  dogs,  and  coat-colour  in  horses  are  :  "4749,  "5257,  and  •6329.  These 
are  quite  incompatible  with  that  theory.  I  venture  accordingly  to  suggest  that  we 
have  here  cases  of  an  inheritance  which  does  not  blend,  and  that  it  is  an  inheritance 
which  is  far  more  closely  described  by  the  numbers  we  have  obtained  on  the  theory 
before  developed  of  exclusive  inheritance  than  by  the  law  of  ancestral  heredity. 

Taking  in  conjunction  with  these  results  for  collateral  heredity,  those  for  parental 
and  grandparental  inheritance,  we  see  that  coat-colour  in  horses  diverges  widely  from 
the  laws  which  have  been  found  sufficient  in  the  cases  of  stature  and  cephalic  index 
in  man.  The  latter  characters  are  really  based  on  a  complex  system  of  parts,  while 
the  determination  of  coat-colour  may  depend  on  a  simple  or  even  single  factor  in  the 
plasmic  mechanism.  Here  Mr.  Galton's  suggestion  of  an  exclusive  inheritance  of 
separate  parts  ('Natural  Inheritance,' p.  139)  may  enable  us  to  understand  why 
stature  and  cephalic  index  differ  so  widely  in  their  laws  of  inheritance  from  coat-  and 
eye-colours. 

Part  III. — On  the  Inheritance  of  Eye-Colour  in  Man. 

(10.)  On  the  Extraction  and  Reduction  of  the  Data. — The  eye-colour  data  used  in 
this  memoir  were  most  generously  placed  at  my  disposal  by  Mr.  Francis  Galton. 
They  are  contained  in  a  manuscript  wherein,  by  a  simple  notation,  we  can  see  at  a 

*  Pearson,  '  Phil.  Trans.,'  A,  vol.  187,  p.  253  et  seq.    See  Note  I.  at  the  end  of  this  paper. 

t  Fawcett  and  Pearson,  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  413  et  seq. 

t  Present  memoir,  p.  1 1 3  seq. 

§  Pearson,  'Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq. 
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glance  the  distribution  in  eye-colour  of  a  whole  family  in  its  numerous  male  and 
female  lines.  Such  complete  details  of  the  various  direct  and  collateral  relationships 
I  have  not  hitherto  come  across,  and  from  them  I  was  able  to  form,  in  the  course  of 
some  months  of  work,  the  twenty-four  tables  of  correlation  which  are  given  in 
Appendix  II.  These  tables,  for  the  first  time,  give  the  whole  eight  series  of  grand - 
parental  and  the  whole  eight  series  of  avuncular  relationships,  besides  such  as  we  have 
deduced  for  other  characters  previously,  i.e.,  the  four  parental,  the  three  fraternal  rela- 
tionships, and  the  table  for  assortative  mating.  The  very  great  importance  of  this 
material  will  at  once  be  obvious,  and  I  cannot  sufficiently  express  my  gratitude  to 
Mr.  Galton  for  allowing  me  to  make  free  use  of  his  valuable  data. 

At  the  same  time  we  must  pay  due  regard  to  the  limitations  of  this  material,  which 
it  is  needful  to  enumerate,  so  that  too  great  stress  may  not  be  laid  on  the  irregularities 
and  divergences  which  arise  when  we  attempt  to  reduce  the  results  to  laws.  These 
limitations  are  as  follows  : — 

(«.)  While  the  data  of  about  780 'marriages  are  given  in  the  record,  they  belong- 
to  less  than  150  separate  families.  All  our  relationships,  therefore,  contain  pairs 
weighted  with  the  fertility  of  the  individual  families.  Thus  it  was  necessary  to  enter 
every  child  of  a  mother,  every  nephew  of  an  uncle,  and  so  forth.  In  the  horse  data 
we  could  take  1000  distinct  mares  and  give  to  each  one  foal  only.  That  is  not  possible 
in  the  present  case. 

(6.)  The  colour  of  eyes  alters  considerably  with  age.  It  is  not  clear  that  some  of 
the  eye-colours  are  not  given  for  infants  under  twelve  months,  and  certainly  the  eye- 
colours  in  the  case  of  grandparents  and  others  must  have  been  taken  in  old,  perhaps 
extreme  old,  age.  In  a  large  number  of  other  cases  of  great  grandfather,  great  great- 
grandfather, &c.,  great  uncles,  and  so  forth,  the  eye-colours  must  have  been  given 
from  memory  or  taken  from  portraits — in  neither  alternative  very  trustworthy  sources. 
Thus  while  the  horse  colour  is  always  given  for  the  yearling  foal  by  the  breeder,  the 
eye-colour  is  given  at  very  different  ages,  and  comes  through  a  variety  of  channels. 

(c.)  The  personal  equation  in  the  statement  of  eye-colour,  when  the  scale  contains 
only  a  list  of  tint-names  is,  I  think,  very  considerable.  The  issue  for  the  collection  of 
data  of  a  plate  of  eye-colours  like  that  of  Bertrand  would  be  helpful,  but  we  can 
hardly  hope  for  a  collection  of  family  eye-colours  so  comprehensive  as  Mr.  Galton's 
to  be  again  made  for  a  long  time  to  come. 

These  causes  seem  to  me  to  account  for  a  good  deal  of  the  irregularity  which  appears 
in  the  numerical  reduction  of  the  results,  but  they  are  not,  I  hold,  sufficient  to  largely 
impair  the  very  great  value  of  Mr.  Galton's  material. 

In  tabulating  the  data,  I  have  followed  the  scale  of  tints  adopted  by  Mr.  Galton, 
and  I  have  used  the  entire  material  available  in  the  cases  of  the  grandparental, 
avuncular,  and  marital  relations.  I  nearly  exhausted  the  data  for  the  parental 
relationships,  but  in  these  tables,  which  were  first  prepared,  I  stopped  short  at  1000 
for  the  sake  of  whole  numbers.    1  found,  however,  that  it  did  not  make  the  arithmetic 
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sensibly  shorter,  and  I  afterwards  dropped  this  limitation.  In  the  case  of  brethren 
I  took  1500  of  each  case — I  daresay  I  could  have  got  2000  out  of  the  records.  As 
the  light-eyed  brethren  are  entered  first  in  Mr.  Galton's  MS.,  the  First  Brother 
in  my  unsymmetrical  tables  is  always  lighter-eyed  than  the  Second  Brother,  hence 
the  tables  had  to  be  rendered  symmetrical  by  interchanging  and  adding  rows  and 
columns  before  we  could  reduce  them.  Thus  the  symmetrical  tables  have  an  apparent 
entry  of  3000  pairs.  Of  course  1500  is  the  number  used  in  finding  the  probable 
error  of  the  correlation  coefficient.  The  like  difficulty  does  not  occur  in  the  brother- 
sister  table,  where  indeed  the  difference  of  mean  eye -colour  for  the  two  sexes  would 
not  allow  of  our  making  the  table  symmetrical.  A  comparison  of  the  symmetrical 
with  unsymmetrical  tables  for  colts-colts  and  fillies-fillies,  will  show  how  little  need 
there  is  for  rendering  the  tables  symmetrical  when  pairs  are  taken  out  of  any  similar 
class  and  tabulated  without  regard  to  the  relative  magnitude  of  the  character  in  the 
two  individuals  of  the  pair,  i.e.,  Weatherby's  record  places  the  individuals  simply  in 
order  of  birth  and  not  of  darkness  or  lightness  of  coat-colour. 

Table  VII.  gives  the  value  of  the  chief  numerical  constants  deduced  from  the  twenty- 
four  eye-colour  tables  in  Appendix  II.* 

(11.)  On  the  Mean  Eye-colour  having  regard  to  Sex  and  Generation. — In  order  to 
test  the  degree  of  weight  to  be  given  to  our  conclusions,  I  have  drawn  up  a  table  o^ 
probable  errors  for  four  typical  cases — cases  by  no  means  selected  to  give  the 
lowest  possible  values.  Further,  in  Table  VIII.  I  have  given  the  probable  error  in 
the  position  of  the  median  as  determined  in  terms  of  the  grey,  blue-green  range  by 
the  modification  of  Mr.  Sheppard's  formula  (see  p.  95).  The  grey,  blue-green  range 
of  eye-colour  is  about  one-fifth  of  the  total  observed  range,  so  that  the  probable  error 
in  the  position  of  the  median  varies  from  about  '4  to  1  per  cent,  of  that  range.  This 
is  not  a  large  error,  but,  relative  to  the  small  variations  of  value  with  generation  and 
sex,  it  is  sensible,  and  we  must  nob  draw  too  fine  conclusions  on  the  basis  of  single 
inequalities. 


Table  VI. — Table  of  Probable  Errors  in  Eye-colour  Data. 


Relations. 

Vx 

C 

2t 

Mother  and  Son     .  . 

•0253 

•0188 

•0431 

•0267 

■0256 

•0283 

Maternal  Grandmother 

and  Granddaughter  . 

•0348 

•0350 

•0767 

•0276 

■0314 

•0361 

Sister  and  Sister     .  . 

Maternal   Aunt  and 

•0244 

•0244 

•0216 

■0216 

•0234 

Nephew  

•0230 

•0186 

•0414 

■0255 

■0250 

■0302 

*  The  theoretical  formulae  by  aid  of  which  these  constants  were  determined,  have  been  indicated  in  the 
earlier  part  of  this  memoir,  and  in  Part  VII.  of  the  present  series  on  Evolution.  The  actual  work  of 
reduction  has  been  most  laborious,  l)ut  I  trust  that  our  results  are  free  from  serious  error. 
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If  we  examine  this  table  we  see  that  the  error  in  17  amounts  to  from  '02  to  '025 
when  we  have  upwards  of  1000  tabulated  cases,  but  can  amount  to  -035  when  v^e 
have  as  few  as  700  to  750  tabulated  cases.  An  examination  of  the  values  of  rj  in 
Table  VII.  shows  us  that  most  of  our  differences  with  probable  errors  taken  on  this 
scale  are  very  sensible.  A  comparison  with  Table  VIII.  shows  us  that  the  probable 
error  of  the  median  is  always  greater  than  the  probable  error  of  yj,  and  accordingly 
the  former,  being  much  easier  of  calculation,  may  be  taken  as  a  maximum  Hmit.  The 
probable  errors  of  i.e.,  the  ratio  of  0-,,  to  cr^,  are  more  considerable,  amounting  to  about 
•04  for  our  longer  series,  and  even  to  '077  in  the  case  of  the  short  series  of  grand- 
mother and  granddaughter,  but  in  this  case  C  actually  takes  its  maximum  value 
of  1-291,  so  that  the  error  is  under  6  per  cent.  ;  in  the  longer  series  it  is  under  5  per 
cent.  Again,  we  see  that  in  most  cases  our  differences  in  the  ratio  of  variabilities  are 
quite  sensible.  It  must  be  admitted,  however,  that  the  ratio  of  variabilities  as  based 
on  the  grey  blue-green  range  of  eye-colour  is  not  as  satisfactory  as  that  based  on  the 
bay  range  of  coat-colour  in  horses.  In  the  latter  case,  one-half  of  the  horses  fall  into 
the  bay  range,  but  only  about  a  quarter  of  mankind  fall  into  the  grey  blue-green 
range  of  eye-colour,  and,  further,  the  appreciation  of  eye-colour  seems  to  me  by  no 
means  so  satisfactory  as  that  of  coat-colour  in  horses. 

I  have  tried  a  further  series  of  values  for  the  ratio  of  the  variabilities  by  measuring 
the  ranges  occupied  not  only  by  the  tints  grey  blue-green,  but  by  the  whole  range  of 
tints  3,  4,  5,  and  6  of  Mr.  Galton's  classification  (see  p.  87).  Lastly,  I  have  taken 
a  third  method  of  appreciating  the  relative  variabilities,  namely,  by  using  the  method 
of  column  and  row  excesses,  and  Ej,  discussed  in  Part  VII.  of  this  series.  While 
this  method  has  the  advantage  of  using  all  and  not  part  of  the  observations  to  deter- 
mine the  ratio  of  cr^.  to  o-y,  and  so  naturally  agrees  better  with  the  results  based  on 
the  four  than  the  one  tint  ranges,  it  suffers  from  the  evil  that  these  excesses  can  only 
be  found  by  interpolation  methods,  which  are  not  very  satisfactory  when  our  classes 
are,  as  in  this  case,  so  few  and  so  disproportionate.  On  the  whole,  this  investigation 
of  relative  variability  is  the  least  satisfactory  part  of  our  eye-colour  inquiry,  and  I 
attribute  this  to  two  sources  : — 

(i.)  The  vagueness  in  appreciation  of  eye-colour  when  no  colour  scale  accompanies 
the  directions  for  observation  (cf.  p.  103,  (c)  ). 

(ii.)  A  possible  deviation  from  true  normality  in  the  factor  upon  which  eye-colour 
really  depends  {cf.  p.  80,  (ii)  80), 

Lastly,  we  may  note  that  the  probable  error  in  the  correlation  amounts  in  most 
cases  to  less  than  -03,  rising  only  somewhat  above  this  value  for  grandparental 
inheritance,  where  our  series  are  somewhat  short — 650  to  750  instead  of  1000.  Here 
again  most  of  the  divergences  are  quite  sensible. 

Allowing  accordingly  for  the  comparative  largeness  of  our  probable  errors,  ^ve 
shall  do  best  to  base  conclusions  on  the  general  average  of  series  ;  to  insist  on  general 
inequalities  rather  than  on  exact  cpiantitative  differences,  and  to  eiii])].uisise  the 

VOL.  CXCV. — A.  p 


TOG 


PROFESSOR  K.  PEARSON  AND  DR.  A.  LEE  ON 


o  . 

.1  « 

O  CD 

O  U 

O 


o  o  o  o 
o  o  o  o 
o  o  o  o 


o  o  o 
o  o  o 

»0  iO  lO 


05  CO  o  on 
lo  00  ^  o 

T— I  00  >— I  O 
lO  CO  lO  lO 


CO  IC) 

o  CO  m 

T— I  ^  o 


O  Ci  O  (M 

1-^1 — ^  I— (  CJi 
00  03  lO  •— 1 
lO  ^  ^  lO 


OS  CO  CO 
CO  «5  t— I 
.— I  ^  (M 
lO  ^  ^ 


J.-~ 


lOi-Hi— T— It—  5003 
CDOOt-GO'*!— ilOCO 


<M  ^  lO  05  ^  1— I  CO 
t—  to        CO  ^  00  C-l  CO 

1— ICOOOr— (COr— 1-^ 

^  CO  ^  CO  CO  (M  (M  <M 


ooo<M-*coa3iooo 

O^CM'*CO00'*^lO 
C<lr— l<Mr— li-Hi— Ii-hO 


IOOCO<MOOCOIO-* 

coooco^-^ai<MO 

CDOOCOOOr-Hlr-COCO 

CO  <M  CO  t— I  CO  1— I  (M  (M 


00  I— I  (M        (M  OS  lO 

(MOlCOOOCOOOlT— I 
CO  CO  CM  (M  <M  (M 


'^QOt-Oit-t-OOlO 
(M  O  CO  lO  O  CM  1— I 
00  00t-Oir5<MlO05 
(M  <M  CO  (M  CM  CM  (M  1— < 


o  « 


%^  O 


CO  O  CO 
O  I— 1  05 
lO  CO  CO  o 
lO       '*  >o 


(3i  CO  lO 

CO  CO  r- 1 

1— I  CO 

lO 


CO  (M  t—  05  <M  lO  CO  O 
I— iOt— ICDCMOCMOO 
(MOOt-Olt—  CM>Oi-H 
-^COCOCMCMCMCMCO 


■*-*«D00t-CMCOi— I 
O^OCOCOOSt—O 
CMOOt-CiOOOTlOi— I 
COCqCOi— ICMt— ICMCM 


b 


kO  iCi  O  lO 
00  00  1— I 
CO  00  CO  00 
<J3  00  O  03 


CO  Ol 

lO  I  lO 

CJ3  00 

o  o 


o 


Jt-  CO  CO  00 
CO  lO  CO  00  I 
CD  C~-1  CO  I— I 
O  T-H  05  O 


o 

CO 


o  <b  ^ 

•I 

t>  o)  9< 


(M  <M  W 
CM  CM  CO  CM 
cJ3  CO  03  t— 
lO       o  t- 


^  9  ^ 


00  ^  lO  lO 
T— I  CO  00  Oi 
O  O  ^ 
lO  lO  CO  00 


00  I— I 

CM  lO 
CO  00  O 
CO  00  00 


00  CO 
(M  ^ 

CO  00  to 
OO  ^ 


COOlCOCOCiOCMCO 

ocmcocO'^Ot— 
o^cooicoinoo^i:— 

OlOTCOOi— lOCJOt— 


lO  00  CM  CO  CO  (M  00 
^  <M  I— I  O  05  CM  Ol  CO 
COi— lOOiOOO'— 1<03 
1 — lOOiOSi — icJiOO 


o 

CM 
OS 


CM 
CO 
CO 


CO 
CO 


OO^CMlOCSiOSt—  Ol 

oioocMt-ioinooo 

OCOr-HCOCOCMOOOS 
00030100051— I  (M 


>Ci       CM  O  CM  00  CM 

r— I  J>.  CM  t— I  >— I  i-H 

OOOOl— llOOOOOl— I 
CDOOiOOCJS'-HOlOl 


O^CncMCOcOOiO 
COCOCMlOCO»OlOc35 

1 — li — ^r-HCOt—i — I' — lO 

t^qpt-cx)cpopt-03 


CM  CO  00  CD  CO  Oi  05  (M 
OCOCMi— ICDCOCOOO 
^Ot-00lO»O0005 
COOJt—OOCOt-COt- 


lO00'*>Oi— iCOi — ICO 
CO  03  in  1-H  03  03  t— 
CDCDi— li— I'^^COCO 
^-^CMCOOOOlOJt- 


t-t-OCD<MOOOiO 
OOOOlOlOOOOOt—  1— I 
C0O00lO03CJ5i— ICM 
t-OO-^xtHCOt-OOJ:- 


03 


p-( 


B.  ^  ^ 


u 

5:;  CO  tc 
W  M  c« 


^        ^         ^  pE3 

tJ3      bO  &iO 
oJS  oJ3  oJS  oJ§ 


03         CD         O  (D 
0)  .S   05  .S   05         03  -Jh 


03  03 


03  03 


03 

O  -t^  o 

%t  ^  i-^ 
m pq 


to 

M 


03  03 

^  J3 

-1-3  -1-3 
03  03 

t3 

C  C 

c  c 
u  u 

03  03 
03  03 

PM  Ah 


^-1  t-i 

03  03 

cS  03 

0  s 


c3  03 


03  03 

o  o 


03  03 
O  O 


c3  03 


U  fH 

Oc5 


c3  cS 

a?  fli  3^ 

o3  03  i_|  k_| 
P-l  P-l 


03  a3^^+^+^40-^^ 

11 S  g  §  s  §  § 
f3  f3  U)    ^  <^  <1<! 


MATHEMATICAL  CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION. 


d  .,—1 

i-H  -*  lO  O 

!>.  00  X:^  CO 

CO  CO  CO  CO 

o  o  o  o 

as  o  00 

OS  I-H  OS 
I-H  (M  (M 

o  o  o 

•0409 

i-Ht^i-Ht-^OQOl:^ 
OOCOi-hi-hOCOOO 
CO'^'^"^"^"^'^"^ 

oooooooo 

I— 1  lO  O  <M  CD  t-  OS 

(M  CO  OS  (>1  CO  I-H  r-H  r-H 

cocooicococococo 
oooooooo 

O     f-l  05 

1 

fL|  oj  a 

c•^  T-^  c•^ 

Ol  CO  lO  i~ 

CO  ^  CO  CO 

o  o  o  o 

Ol  O  CO 

C5  I-H 
r-H  (M  CM 

o  o  o 

•0377 

u:jcoco-*OoocD>ci 
oo»ocooioc^oo<r^ 
co^co^coco-*»o 
oooooooo 

CDf—i— iCDOSOl — ^i-H 
CO(MOCOOSlOi-HOS 
(M  CO  CO  CO  (M  CO  CO  (M 
OOOOOOOO 

CO  b 

O  00  o 

I-H  t-  O  O 

1— 1  I— I  oi  lo 

1     >0  CO  ^  CD 

-*  CO 

CO  O  I-H 
CO  I-H  Ci 

lO  CD  lO 

■6203  * 

COCOCOCDCDOi-HiO 
OOsfMCO-^OOlCD 

ot^inoooocoooci 

CDCOlOt^lOt^lOt^ 

OO^COOOOSt^i-HCO 
CDOCOO<MOS£~t~ 
C-T  CO        C^T  OS  O  CO  00 

lO  00  lO  00  lO  00  If; 

Range  of  tii 
3, 4, 5, 6  in  tei 
of  0- .  Range  = 

o  CO  CO  I-H 

lO  I— 1  lO  CD 
05        CD  CJ 
CO  CD  »0 

!>•  C-1 
CO  O  05 
CD  I-H 
lO  CD  CO 

1-7113 

C^l  0"i  I— H  OS  CO       OS  t~— 

0SC0<M05i-Hi-Ht^-* 

Oit-(Mi-HOCDCO-* 
COCOCD'^CDIOCO'^ 

i"0               ("O  1    1  -^J^  r-- 1  ^1 

'-•  J   "^.J                             AT*  tj^J 

COlOfMOSi-Ht^COO 
CO  (M  CO  CO  CO  OS  CO  (M 
'+-^iO-*CO-^J>-l>- 

fO  b 

+3  ^ 
fl  O  II 

CD  CD  (M  CO 
05  CD  CO  1-H 

1— 1  a>  T— 1  CO 

CD  !>.  1:^ 

Ol  l>-  CO 
»C  ^  ^ 
CO  CO 

•7423 

t^oot^Oi-HQot-io 

i-hi-hOCOOSCOC-I^ 
O'^l^WCOCOlOCO 
OOt-t^t^t^t^t-t^ 

lOOOOCDt^-i^i-Ht— 
CDOSCOlOOSCSt^CO 
<MlOi-H00C00000i-H 
I>.t>.00t~l--.1>.1>.00 

Range  of  ti 
in  terms 
a- .  Range  = 

a 

CD  Ci  !>■  00 

^  00  ■— 1 
t-  I— 1  CD  I-H 

CO  CO 

05  t-  O 

IC  I-H 

CO  o 

t~  !>■  00 

•8061 

OS        O  O  (M  lO  <M  fM 
CO  'TtH  (M  OS  00  (M  CO 
OSi — ^r— lOSOOOSCOOS 
t^t~00t~00t^CDlO 

<M  lO        00  00  CD  (M  lO 
^  OS        CD  O  (M  <M  CI 
(M  CD  OS  ^  (M  r-H  O  OS 
OOlr^t^t^oOt^oOQO 

Ratio  of 
variabilities 
from  "excess." 

i 

c 

CTl  lO  1-H  CO 

I— 1  05  -^JH  05 

o  ^  o  >o 
o     o  oi 

00 

l-H  I— 4  O 

o 

•0657 

t^t-^I-H-^lOt^I-H 

00t~'*00CDl:~OQ0 
■^OCO<M£^OOOCD 
OOOOSOSOSOSOO 

lOt^COOSCD<NCDO 
<M  (M  I-H  ^  CO       I-H  (M 
OOinOOOO-*l:^CO 
OSOOOOSOCOOOS 

I-H  r— 1 

1 — 1 

I-H 

1 — ^  r-H  I — 1 

i-H         r-H  I-H 

b"^ 

r— 1  (^^  oi 

00  CO  IC  (M 
05  lO  05 

<33  O  <35  O 
'  Ah 

1>- 

I-H 

r-H  I-H  05 

OS 

•9383 

CDCOt-mfMOSt^OS 
CO(MCDJ>.^i-HOSr-H 
lOOSCOt~(MOi-HlO 
OSOSOSOOi-hOi — I 
1 — ^  I-H  r-H  1-H  I-H 

OS00-*-*-*-*(MO 
t^tMOOlOCOOCOCO 
i-Hi--OSi-HOS00C0Ir- 
Oi— lOSOOs^OsO 

I-H  1 — 1  I — I  I — 1  I — 1 

tives. 

Son  

Daughter  .    .  . 

Son  

Daughter  ... 

Brother.    .    .  . 
Sister  .... 
Sister  .... 

Wife  .... 

Grandson  .    .  . 
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Aunt    .    .  . 
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Father 
Father 
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Brother 
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Brother 
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Paternal 
Paternal 
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Paternal 
Paternal 
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Paternal 
Paternal 
Maternal 
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Paternal 
Maternal 
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general  tendency  of  a  series  rather  than  pick  out  single  differences  for  special 
consideration.  If  we  do  this  we  shall  still  find  that  remarkable  results  flow 
from  our  Tables  VII.  and  VIII.,  most  of  which  seem  hitherto  to  have  escaped 
attention. 

I  return  now  to  the  special  topic  of  the  present  section,  the  mean  eye-colour,  after  this 
lengthy — if  needful — digression  on  the  probable  error  of  the  data  given  in  our  tables. 

We  may,  I  think,  safely  draw  the  following  conclusions  : — 

(a.)  Man  has  a  mean  eye-colour  very  substantially  lighter  than  that  of  ivoman. 

If  we  compare  the  mean  eye-colour  of  father  Avith  mother,  of  son  with  daughter, 
of  brother  with  sister,  of  grandfather  with  mother,  of  uncle  with  aunt,  of  grandson 
with  granddaughter,  of  nephew  with  niece,  we  have  the  same  result— man  is  distinctly 
lighter  eyed  than  woman. 

{h.)  There  appears  to  he  a  secular  change  taking  place  in  eye-colour,  hut  this  is 
more  marked  and  definite  in  the  man  than  in  the  ivoman. 

Thus  we  have  the  following  mean  values  for  77  in  classes,  which  must  roughly 
represent  successive  generations  : — 

Grandfather.    .    .    '3658  "]  .^^^g       Grandmother    .    .    '8757  -8523 


Father     ....     -5241  J  Mother    ....  -8290 

Son  -5929  \  .g^g^       Daughter.    .    .    .     7524  \  .g^^^ 

Grandson.    .    .    .     '7039  J  Granddaughter.    .     -8508  J 

Another  comparison  may  be  made  by  noting  that  grandsons  are  darker  than 
grandfathers,  sons  than  fathers,  nephews  than  uncles.  Similarly,  granddaughters  are 
lighter  than  grandmothers,  daughters  than  mothers,  but  nieces  are  not  lighter  than 
aunts,  as  we  might  have  expected.  Thus,  while  the  records  show  a  definite  darkening 
of  the  eyes  of  men,  there  appears  to  be  a  certain  but  less  sensible  lightening  of  the 
eyes  of  women.  Again,  the  younger  generations  are  much  closer  in  eye-colour  than 
the  older  generations.  The  difference  in  eye-colour  between  grandsons  and  grand- 
daughters, sons  and  daughters,  nephews  and  nieces  is  only  about  15  per  cent,  of  the 
grey  blue-green  range,  but  for  fathers  and  mothers  it  is  30  per  cent.,  and  for  grand- 
fathers and  grandmothers  50  per  cent. 

When  we  realise  that  difference  in  eye-colour  appears  to  be  a  sexual  character,  the 
true  explanation  of  this  secular  change  in  eye-colour  becomes  still  more  obscure. 

If  the  lighter  eye-colour  of  the  older  generation  be  due  to  an  effect  of  old  age,  why 
is  it  conspicuous  only  in  the  male  and  not  in  the  female  ?  Why  is  the  mother  sensibly 
darker  than  the  daughter,  but  the  father  sensibly  lighter  than  the  son  ? 

Further,  sup230sing  light  eyes  much  commoner  among  our  grandfathers  than  among 
their  grandsons,  and  dark  eyes  among  our  grandmothers  than  among  their  grand- 
daughters, we  cannot  attribute  the  great  approach  in  eye-colour  to  a  blending  of  the 
parental  characters,  for,  as  we  shall  see  later,  eye-colour  does  not  seem  to  blend,  it  is 
rather  an  exclusive  character.    We  should,  therefore,  be  thrown  back  on  prepotency 
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of  the  mother — a  conclusion  possibly  warranted  by  our  results  in  the  case  of 
daughters,  but  not  in  the  case  of  sons.  Again,  why  was  there  such  a  marked 
difference  in  eye-colour  between  the  men  and  women  of  three  or  four  generations 
back  And  if  it  was  a  sexual  character,  why  is  it  disappearing  ?  Was  it  not, 
perhaps,  a  racial  difference  ?  Light  and  dark  eyes  are  not  unusually  associated  with 
distinct  races,  and  it  is  just  possible  that  the  change  in  eye-colour  is  a  product  of 
reproductive  selection  ;  the  old  blue-eyed  element  of  the  population  may  be  dwindling 
owing  to  the  greater  fertility  of  the  women  of  dark-eyed  race,  and  thus  without  any 
obvious  struggle  for  existence  and  survival  of  the  fitter,  the  blue-eyed  race  may  be 
disappearing  from  England,  as  the  Langobard  type  has  so  largely  gone  from  Italy  and 
the  Frank  from  France,  t  It  will  not  do  to  be  dogmatic  about  these  matters,  but  the 
more  one  measures  characters  in  different  generations,  the  less  stable  do  races  appear 
to  be.  We  speak  of  the  national  characters  of  the  Englishman  or  the  Frenchman 
based  upon  our  experience  of  how  these  races  have  acted  in  past  history,  but 
although  there  has  been  no  great  racial  invasion  nor  struggle,  can  we  safely  assert 
the  physical  characters  of  the  Englishman  to-day  do  not  differ  substantially  from 
those  of  the  Englishman  of  the  Commonwealth  ?  It  seems  to  me  that  the  possibly 
continuous  change  of  characters  in  a  race,  not  subjected  to  very  apparent  internal  or 
external  struggle,  is  a  problem  of  the  highest  interest  to  the  anthropologist  and 
ultimately  to  the  statesman. 

Whatever  be  the  explanation  of  this  secular  change  in  eye-colour,  it  appears  to 
correspond  singularly  enough  to  the  secular  change  we  have  noted  in  the  coat-colour 
of  thoroughbred  horses — in  the  older  generation  the  sexes  differ  more  widely  than  in 
the  younger. 

(c.)  The  maternal  male  relative  (grandfather  and  uncle)  is  substantially  lighter-eyed 
than  the  paternal  male  relative  {grandfather  and  uncle). — I  see  no  explanation  for  this 
curious  result,  but  it  seems  worth  while  to  specially  note  it,  for  there  are  curious 
anomalies  in  the  inheritance  through  the  various  male  and  female  lines  which  may 
find  their  complete  explanation  some  day  when  more  and  possibly  more  trustworthy 
characters  have  been  investigated. 

(12.)  On  the  Variability  of  Eye-colour  loith  reference  to  Sex  and  Class. — The 
determination  of  the  relative  variability  of  not  exactly  measurable  characters  is,  as 
we  have  already  seen  (p.  105),  a  somewhat  delicate  problem.  It  is  more  so  in  the  case 
of  eye-colour  in  man  than  of  coat-colour  in  horses,  for  there  is  greater  difference  in 
the  means,  and  accordingly  the  ratio  of  o-.r/cr^,  as  found  from  the  ratio  of  the 
"  excesses"  (p.  105),  will  be  even  less  reliable.^    The  class  indices  corresponding  to  the 

*  Mr.  Galton's  records  went  back  to  great-grandfathers,  many  of  whom  accordingly  appear  in  our 
data  for  grandfathers. 

t  See  Note  II.  at  the  end  of  this  paper. 

\  The  relative  variability  of  all  classes  was  worked  out  for  thorough-l)red  horses  by  the  "  excess  " 
method,  and  in  only  one  case — that  of  dam  and  colt — did  it  differ  from  the  bay  range  method  in  its 
determination  of  the  class  with  the  greater  varialjility. 
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grey  blue-green  range  are  also  not  entirely  satisfactory  in  their  results,  nor  those 
taken  for  a  still  larger  range  covering  tints  3,  4,  5,  -and  6,  or  blue-green,  grey,  hazel, 
light  brown,  and  brown,  which  cover  roughly  about  1-5  to  1-6  times  the  standard 
deviation.  We  shall  now  consider  the  results  of  three  methods  of  considering  the 
relative  variability,  (a)  from  the  excesses  given  in  columns  1  and  2  of  Table  VIII.  ; 
(^)  from  the  grey  blue -green  range  given  in  columns  3  and  4  of  Table  VIII,  ;  and  (y) 
from  the  range  of  tints  3  to  6  inclusive  given  in  columns  5  and  6  of  Table  VIII.  As 
we  have  already  indicated,  these  methods  are  not  likely  to  give  the  same  relative 
magnitude  numerically  for  the  variabilities  ;  we  must  content  ourselves  if  they  agree 
in  making  the  ratio  of  cr,.  to  cr,,  greater  or  less  than  unity.  Now,  in  the  twenty-two 
cases 

a  and  /8  disagree  in  10  cases. 
ft  and  y  disagree  in  7  cases, 
a  and  y  disagree  in    5  cases. 

Further,  for  the  five  cases  in  which  a  and  y  disagree,  those  for  father  and  son, 
paternal  grandfather  and  grandson,  maternal  uncle  and  nephew,  show  so  little 
difference  of  variability  in  the  two  sexes  that  both  methods  give  sensibly  the  same 
results,  i.e.,  equality  of  variability.  In  the  cases  of  the  paternal  grandfather  and 
grandchildren,  the  two  methods  diverge  rather  markedly. 

It  will  be  of  interest  accordingly  to  work  out  the  probable  errors  as  given  by  the 
excess  method  for  one,  say  the  first  of  these  cases.  The  theory  is  given  in  Part  VII.  of 
the  present  series.  Here  =  275*165,  =  309*013,  whence  we  find  probable  error 
ofE^  =  17*273,  probable  error  of  E2  =  16*925,  correlation  between  errors  in  Ej^  and 
E3  =  —'4424,  probable  error  in  cTj/o-g  =  '0394. 

Thus  the  probable  error  in  the  ratio  of  the  variabilities  is  about  4  per  cent.,  and  of 
the  order  of  the  quantities  by  which  we  are  distinguishing  the  relative  size  of  o-^ 
and  cr^. 

Further,  there  is  another  source  of  error  in  evaluating  Ej  and  E^  due  to  the 
method  of  interpolation  used,  and  this  would  still  further  increase  the  probable  error 
in  cTj/cTg.  We  cannot  therefore  lay  any  very  great  stress  on  the  manner  in  which  the 
ratios  of  variabilities  for  the  paternal  grandfather  and  grandchildren  have  swung 
round  from  (a)  to  (y). 

A  further  examination  shows  us  that  in  all  five  cases  wherein  y  differs  from  a  it  is 
in  accord  with  y8.  I  shall  accordingly  take  y  as  the  standard  criterion,  but  in  those 
cases  in  which  it  has  agreement  with  a,  its  conclusions  must  be  given  greater  weight. 

(a.)  On  the  Relative  Variability  of  Sex  in  Eye-colour. — The  following  male  groups 
are  more  variable  than  the  corresponding  female  groups  : — 

Sons  of  fathers  than  daughters  of  fathers. 
Sons  of  mothers  than  daughters  of  mothers. 
Brothers  of  brothers  than  sisters  of  sisters. 
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Grandsons  (in  four  series)  than  granddaughters  (in  four  same  series). 
Nephews  (in  four  series)  than  nieces  (in  four  same  series). 
Fathers  (in  two  series)  than  mothers  (in  two  series). 
Grandfathers  (in  four  series)  than  grandmothers  (in  four  like  series). 
Uncles  (in  four  series)  than  aunts  (in  four  Hke  series). 

The  following  female  groups  are  more  variable  than  the  corresponding  male 
groups  : — 

Sisters  of  brothers  than  brothers  of  sisters. 
Wives  than  husbands. 

We  have  thus  21  series  with  male  preponderance  against  only  two  with  female 
preponderance  of  variability. 

Again,  the  mean  range  of  tints  3,  4,  5,  6  in  22  male  series  equals  1*5424  cr^,  and 
in  22  female  series  equals  1*6740  cr,p  or  we  have  enough  evidence  to  show  that  the 
ratio  of  male  to  female  variability  is  about  1*08.* 

This  greater  variability  of  the  male  in  eye-colour  is  of  considerable  interest.  It 
does  not  appear  to  be  a  result  of  sexual  selection,  for  so  far  as  our  comparatively  small 
series  weighs,  husbands  are  less  variable  than  wives.  That  mothers  are,  however,  less 
variable  than  fathers  seems  to  indicate  that  dark-eyed  women  are  more  fertile  f  than 
light-eyed,  for  we  must  bear  in  mind  that  mothers  have  on  the  average  a  darker  eye- 
colour  than  wives.  We  have  thus  again  reached  the  same  conclusion  as  before, 
namely,  that  a  dark-eyed  element  in  the  population  with  a  prepotent  fertility  is 
replacing  the  blue-eyed  element. 

The  other  female  exception  to  the  general  rule  of  greater  variability  in  the  eye- 
colour  of  the  male  is  that  in  mixed  families  the  sisters  appear  to  be  more  variable 
than  their  brothers,  notwithstanding  that  brothers  of  brothers  are  more  variable  than 
sisters  of  sisters.  In  other  words,  so  far  as  eye-colour  is  concerned  an  exceptional 
man  is  more  likely  to  have  brothers  than  sisters,  but  an  exceptional  woman  also  is 
more  likely  to  have  brothers  than  sisters.  The  inference  is  not  very  strong,  as  the 
excess  method  (a)  makes  brothers  of  sisters  and  sisters  of  brothers  of  sensibly 
equal  variability;  it  rests  therefore  on  (B)  and  (y)  only.  Still  it  deserves  fuller 
investigation. 

(6.)  Let  A  and  B  be  two  grades  of  relationship,  of  which  A  refers  to  the  older 
generation,  and  A  and  B  refer  to  either  sex.     Then  the  variability  of  all  the  As 

*  It  is  worth  noting  that  the  ratio  of  male  to  female  variability  in  the  coat-colour  of  horses  is  1-05  (see 
p.  96).  In  both  eases  the  female  is  darker,  i.e.,  has  less  of  "  colour  "  ;  thus  if  we  could  take  a  coefficient  of 
variation  ratio  instead  of  standard  deviation  ratio  as  the  test,  we  should  find  the  difference  of  variability 
less,  possibly  even  zero. 

t  For  if  mothers  are  to  be  less  variable  than  wives,  their  distribution  must  be  more  compressed  round 
the  mean  than  that  of  wives  ;  this  denotes  that  fertility  is  correlated  with  eye-coloiu^  and  the  darker  eye- 
colour  goes  with  the  greater  fertility.    [See  Note  II.  at  end  of  memoir,  however.] 
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who  have  female  B's  is  invariably  greater  than  the  variability  of  all  the  A's  who 
have  male  B's. 

The  law  appears  to  be  universal,  at  least  it  is  absolutely  true  for  all  the  10  cases  to 
which  we  can  apply  it.  Thus  the  father  of  sons  is  less  variable  than  the  father  of 
daughters,  the  maternal  grandmother  of  grandsons  less  variable  than  the  maternal 
grandmother  of  granddaughters,  or  the  paternal  uncle  of  nephews  less  variable  than 
the  paternal  uncle  of  nieces.  In  other  words,  although  women  appear,  in  eye-colour, 
to  be  less  variable  than  men,  they  spring  from  more  variable  stocks. 

This  law  is  a  remarkable  one,  but  in  face  of  the  evidence  for  it,  it  seems  difficult  to 
doubt  its  validity.  Should  it  be  true  for  more  characters  in  man  than  eye-colour,* 
the  conclusions  to  be  drawn  from  it  will  be  somewhat  far-reaching,  however  difficult 
it  may  be  to  interpret  its  physiological  significance. 

(c.)  On  the  Relative  Vay^iability  of  Different  Generations. — We  have  already  had 
occasion  to  refer  to  the  general  rule  that  the  older  generation  will  be  found  less 
variable  than  the  younger,  for  it  is  in  itself  a  selection,  namely,  of  those  able  to 
survive  and  reproduce  themselves.  But  this  rule  is  obscured  in  the  present  case  by 
several  extraneous  factors,  thus  : — 

(i.)  The  male  is  sensibly  more  variable  than  the  female,  consequently  it  is  quite 
possible  that  an  elder  male  generation  should  appear  more  variable  than  a  younger 
female  generation, 

(ii.)  There  appears  to  be  a  secular  change  in  eye-colour  going  on.  Thus  while  the 
grandparental  population  is  a  selection  from  the  general  population,  the  general 
population,  at  a  given  time,  is  a  selection  from  that  of  an  earlier  period. 

Thus,  taking  means  in  the  cases  of  the  grandparental  and  avuncular  relationships, 
we  have  from  (y)  the  following  results  : — 

The  father  is  more  variable  than  son  and  than  daughter. 

The  mother  is  less  variable  than  son  and  more  than  daughter. 

The  grandfather  is  more  variable  than  grandson  and  than  granddaughter. 

The  grandmother  is  less  variable  than  grandson  and  more  than  granddaughter. 

The  uncle  is  more  variable  than  nephew  and  more  than  niece. 

The  aunt  is  less  variable  than  nephew  and  more  than  niece. 

In  other  words,  the  older  generation  is  always  more  variable  than  the  younger, 
except  when  rule  (a),  that  the  male  is  more  variable  than  the  female,  comes  in  to 
overturn  this  law.  If  we  confine  ourselves  to  comparisons  of  the  same  sex  the  rule  is 
seen  to  be  universal. 

We  are  thus  forced  again  to  ask  for  an  explanation  of  the  decreasing  variability  of 
eye-colour,  and  can  only  seek  it  in  that  secular  change  we  have  several  times  had 

*  Fathers  of  daughters  are  more  variable  in  stature  than  fathers  of  sons  ('  Phil.  Trans.,'  A,  vol.  187, 
p.  274).  I  propose  to  reinvestigate  the  question  with  regard  to  mothers  from  the  material  of  my  family 
measuiement  cards,  which  is  far  more  extensive  than  the  material  I  had  at  my  disposal  in  1895. 
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occasion  to  refer  to.  Mean  and  standard  deviation  of  eye -colour  appear  to  have 
changed  sensibly  during  the  few  generations  covered  by  Mr.  Galton's  eye  data. 

It  is  difficult  to  understand  how  any  obscurity  about  the  recording  of  eye-colours 
could  lead  to  anything  but  chaos  in  the  numerical  results.  It  does  not  seem  to 
me  possible  that  such  results  as  we  have  reached  under  (a),  (b),  and  (c),  namely, 
greater  variability  in  the  male,  greater  variability  in  the  stock  of  the  female,  and 
secular  change  in  variability,  can  be  due  to  any  process  of  recording.  I  am  forced  to 
the  conclusion  that  they  are  peculiar  to  the  character  under  investigation,  and  are 
not  due  to  the  manner  of  taking  the  record  or  of  dealing  with  it  arithmetically.  I 
have  purposely  avoided  drawing  attention  to  small  differences  and  forming  any  con- 
clusions which  did  not  depend  on  whole  series  of  groups  and  substantial  averages. 

(13.)  On  the  Inheritance  of  Eye-colour,  {a.)  Assortative  Mating. — Before  we 
enter  on  the  problem  of  inheritance,  it  is  as  well  to  look  at  the  substantial 
correlation  obtained  between  the  eye-colour  in  husband  and  wife.  When  in  1895 
I  reached  the  value  "ODSl  i  '0473  for  stature,  I  wrote,  "we  are  justified  in  con- 
sidering that  there  is  a  definite  amount  of  assortative  mating  with  regard  to  height 
going  on  in  the  middle  classes."*  Since  then  we  have  worked  out  the  coefficients  of 
;  correlation  in  stature,  forearm,  and  span  for  1000  husbands  and  wives  (instead  of 
200)  from  my  family  dataf  cards.  The  results,  which  are  very  substantial,  will  be 
dealt  with  in  another  paper,  and  amply  confirm  my  view  that  assortative  mating  is 
very  real  in  the  case  of  mankind.  The  value  ('0931)  cited  above  is  in  close  agree- 
ment with  the  result  now  reached  ('1002  *0378)  for  eye-colour  in  the  same 
material.  The  correlation  between  husband  and  wife  for  two  very  divergent 
«  characters  is  thus  shown  to  be  about  '1,  or  is  25  per  cent,  greater  than  is  required 
between  Jirst  cousi7is'\.  by  the  Jaiv  of  ancestral  heredity. 

This  remarkable  degree  of  likeness  between  husband  and  wife — the  scientific 
demonstration  that  like  seeks  like — cannot  be  overlooked.  It  shows  that  sexual 
selection,  at  least  as  far  as  assortative  mating  is  concerned,  is  a  real  factor 
of  evolution,  and  that  we  must  follow  Darwin  rather  than  Wallace  in  this 
matter.  § 

(6.)  Collateral  Heredity.  First  Degree. — I  deal  first  with  this  form  of  heredity, 
as  it  offers  least  points  for  discussion.  The  values  of  the  correlation  '5169  for 
brothers,  and  "4463  for  sisters  and  sisters  are  considerably  less  than  what  we  have 
found  for  coat-colour  in  horses,  but,  like  the  value  '4615  for  brothers  and  sisters,  are 
substantially  greater  than  '4  to  be  expected  from  the  unmodified  Galtonian  law. 
They  could  be  reached  by  making  y  greater  than  unity  in  my  statement  of  the  law 
of  ancestral  heredity.  ||    They  could  also  be  given  by  the  law  of  exclusive  inheritance 

*  '  Phil.  Trans.,'  A,  vol.  187,  p.  273. 

t  See  also  '  Grammar  of  Science,'  second  edition,  pp.  429-437. 

X  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  410.  §  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq. 

li  I  have  considered  possible  explanations  of  this  apparently  large  assortative  mating  (i.)  in  both  stature 
VOL.  CXCV. — A.  Q 
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(see  p.  90)  with  a  certain  degree  of  prepotency  in  the  individual  pairing.  As  we 
have  already  noted,  collateral  inheritance  of  the  first  degree  alone  considered 
will  not  enable  us  to  discriminate  between  blended  and  exclusive  inheritance. 

We  note  that  the  male  in  collateral  inheritance  predominates  over  the  female, 
brothers  being  more  alike  than  sisters  in  eye-colour,  and  brother  and  sister  more 
alike  than  sister  and  sister.  The  mean  value  for  inheritance  in  the  same  sex  is, 
however,  greater  than  the  value  for  inheritance  between  opposite  sexes  {cf.  p.  102). 

(('.)  Collateral  Heredity.  Second  Degree. — A  very  cursory  inspection  of  the 
coefficients  of  correlation  for  the  eight  series  of  avuncular  relationships  shows  us 
that  it  is  quite  impossible  that  the  mean  value  should  be  "15  as  required  by  the 
Galtonian  Law.  The  average  value  of  the  avuncular  correlation  is  "2650,  and  of 
the  regression  of  nephew  and  niece  on  uncle  or  aunt  is  '2733.  The  probable  error 
of  the  former  result  will  not  be  more  than  '02,  and  of  the  latter  something  greater, 
as  the  ratio  of  the  variabilities  is  open  to  larger  error.  This  mean  value  is  accord- 
ingly, within  the  limits  of  errors  of  investigation,  identical  with  the  "25  to  be  expected 
on  the  theory  of  exclusive  inheritance.  It  is  a  value  which  appears  to  be  quite 
impossible  on  the  theory  of  blended  inheritance  even  with  my  generalised  form  of 
the  ancestral  law. 

We  may  draw  several  other  important  conclusions  from  our  table  of  avuncular 
correlations : — 

(i.)  Nephews  are  more  closely  related  to  both  uncles  and  aunts  than  nieces  are. 
This  is  true  in  each  individual  case,  whether  it  be  judged  by  correlation  or  regression. 
The  mean  correlations  for  uncles  and  for  aunts  are  as  '3081  to  "2219  respectively. 

(ii.)  Uncles  are  more  closely  related  to  nephews  and  nieces  than  aunts  are.  This 
is  true  for  three  out  of  the  four  individual  cases  ;  in  the  fourth  case  the  difference 
is  of  the  order  of  the  probable  error  of  the  difference.  The  mean  correlations  ot 
nephews  and  nieces  are  as  "2923  and  "2377  respectively. 

(iii.)  Paternal  uncles  and  aunts  are  more  closely  correlated  with  their  nieces  and 
nejihews  than  maternal  uncles  and  aunts.    The  mean  values  are  as  "2719  to  "2580. 

(iv.)  Resemblance  between  individuals  of  the  same  sex  is  closer  than  between 
individuals  of  opposite  sex.  The  mean  values  for  the  avuncular  correlation  in  the 
same  sex  and  in  the  opposite  sex  are  respectively  '2751  and  "2549. 

(v.)  Uncles  are  more  closely  related  to  nephews  than  aunts  to  nieces  (mean 
correlations  as  "3455  to  "2046).  In  fact,  generally,  we  see  a  very  considerable 
preponderance  of  heredity  in  the  male  line  so  far  as  these  avuncular  relations  for 

and  eye-colour,  being  characters  of  local  races,  or  even  families,  and  the  husband  seeking  his  wife  in  his 
own  locality  or  kin;  (ii.)  in  a  possible  correlation  of  homogamy  and  fertility.  See  'Roy.  Soc.  Proc.,' 
vol.  66,  p.  28.  Neither  seem  very  satisfactory.  Consciously  or  unconsciously,  man  and  woman  appear  to 
select  their  own  type  in  eye-colour  and  stature,  until  they  are  apparently  more  alike  than  such  close  blood 
relations  as  first  cousins  !  Until  we  know  how  far  this  correlation  extends  to  other  characters,  it  would, 
perhaps,  be  idle  to  draw  conclusions  as  to  its  bearing  on  widely  current  views  as  to  first  cousin  marriages. 


MATHEMATICAL  CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION.  115 


eye-colour  extend.  It  is  noteworthy  that  while  the  two  highest  correlations  are 
reached  for  nephew  with  paternal  and  with  maternal  uncles,  nearly  the  two  lowest 
are  found  for  niece  with  paternal  and  with  maternal  aunts.  Without  laying 
special  stress  on  each  small  difference,  it  must  be  admitted  that  the  avuncular 
correlations  vary  in  a  remarkable  manner  with  sex,  and  differ  very  widely  from 
the  practical  equality  of  resemblance  which  we  might  d  priori  have  expected  to 
exist  in  this  relationship. 

{d.)  Direct  Heredity.  First  Degree. — Here  we  have  a  mean  value  of  the  paternal 
correlation  =  -4947.  This  is  in  excellent  agreement  with  the  '5  to  be  expected  by 
our  theory  of  exclusive  inheritance  ;  it  is  thus  in  practical  agreement  with  the  value 
of  the  parental  correlation  obtained  for  the  inheritance  of  coat-colour  in  horses.  It 
would  not  be  inconsistent  with  a  high  value  for  y  in  the  theory  of  blended  inheri- 
tance, but  such  a  value  of  y  is  rendered  impossible  by  the  values  we  have  obtained 
for  collateral  heredity  (see  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq.). 

We  may  draw  the  following  special  conclusions  : — (i.)  The  son  inherits  more 
strongly  from  his  parents  than  the  daughter,  the  mean  correlations  are  as  '5160 
to  "4733 ;  (ii.)  The  son  inherits  more  strongly  from  his  father  than  his  mother, 
and  the  daughter  more  strongly  from  her  mother  than  her  father. 

This  is  part  of  the  general  principle  which  we  have  seen  to  hold,  namely  :  that 
change  of  sex  weakens  the  intensity  of  heredi'ty. 

The  correlation  of  father  and  daughter  appears  to  be  abnormally  below  the  other 
three,  but  something  of  the  same  kind  has  been  noted  in  certain  stature  data  ;  as  it 
is,  the  high  correlation  of  father  and  son  renders  the  mean  paternal  correlation  with 
offspring  ('4936)  sensibly  equal  to  the  mean  maternal  correlation  ("4956). 

(e.)  Direct  Heredity.  Second  Degree. — If  we  take  the  mean  value  of  the  eight 
grandparental  correlations,  we  find  it  equals  '3164,  while  the  mean  value  of  the 
regression  of  offspring  on  their  grandparents  is  "3136.  These  results  are  absolutely 
incompatible  with  the  "15  required  by  Mr.  Galton's  unmodified  theory,  and  they  in 
fact  put  the  theory  of  blended  inheritance  entirely  out  of  court.  At  the  same  time, 
unlike  the  cases  of  parental,  avuncular,  and  fraternal  inheritance,  they  cannot  be  said 
to  be  in  good  agreement  with  the  value  "25  required  by  the  theory  of  exclusive 
inheritance.  We  have  to  admit  that  our  grandparental  data  are  shorter  series  than 
in  the  other  cases,  and  that  guesses  as  to  grandparents'  eye-colour,  based  on  memory, 
miniatures,  &c.,  were  more  likely  to  be  made.  Further,  such  guesses  might  easily 
be  biased  by  a  knowledge  of  the  eye-colour  of  more  recent  members  of  the  family. 
Still  a  reduction  from  '32  to  '25  is  a  very  large  reduction,  and  we  have  to  remember 
that  for  long  series  in  the  case  of  the  thoroughbred  horses,  with  no  such  guessing  at 
colour  as  may  occur  with  ancestors'  eyes,  we  found  "3353  for  the  maternal  grand- 
sires,  a  result  in  excellent  agreement  with  the  '3343  found  for  the  maternal  grand- 
fathers in  the  present  case.  Thus  while  the  theory  of  exclusive  inheritance  ivithout 
reversion  suffices  to  describe  the  quantitative  values  we  have  found  for  the  parental, 
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the  avuncular  and  the  fraternal  correlation  in  the  cases  of  both  horse  and  man,  it  is 
yet  in  both  these  cases  unsatisfactory  so  far  as  the  grandparental  inheritance  is  con- 
cerned. It  may  be  imagined  that  if  we  allowed  for  reversion,  we  might  emphasise 
the  grandparental  correlation  beyond  the  value  "25  suggested  by  theory.  But  I 
have  shown  in  my  memoir  on  the  "  Law  of  Reversion,"  that  with  the  parental  correla- 
tion as  high  as  '5,  we  cannot  hope  to  have  the  grandparental  correlation  even  with 
reversion  higher  than  -25.  (See  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq.)  Clearly  the 
values  obtained  for  grandparental  correlation  in  this  paper— the  first  I  believe 
hitherto  investigated — seem  to  present  anomalies  which  our  theory  of  blended 
inheritance  totally  fails  to  account  for,  and  which  may  require  some  modification  of 
our  views  on  reversion  before  we  can  meet  them  on  our  theory  of  exclusive 
inheritance. 

I  note  the  following  general  results  deduced  from  our  values  of  the  grandparental 
correlations  : — 

(i.)  Grandsons  are  more  closely  correlated  with  both  grandparents  than  grand- 
daughters are.  This  is  true  for  three  out  of  the  four  cases ;  the  exception,  maternal 
grandmother,  is  covered  by  another  rule  (iv.).  The  mean  correlation  for  grandparents 
and  grandsons  is  '3294,  and  for  grandparents  and  granddaughters  '3039. 

(ii.)  Grandfathers  are  more  closely  correlated  with  grandchildren  than  grand- 
mothers are.  This  is  true  in  three  out  of  the  four  cases,  the  fourth  being  again 
subject  to  rule  (iv.).  The  mean  correlations  for  grandfathers  and  grandmothers  are 
'3675  and  '2658  respectively. 

(iii.)  Paternal  grandparents  appear  to  be  more  closely  correlated  with  their 
grandchildren  than  maternal  grandparents,  the  average  values  of  the  two  correlations 
being  "3236  and  "3097  respectively. 

(iv.)  Resemblance  between  individuals  of  the  same  sex  is  closer  than  between 
individuals  of  the  opposite  sex.  The  mean  values  for  the  grandparental  and 
grandchild  correlation  in  the  same  and  the  opposite  sexes  are  '3329  and  "3004 
respectively. 

(v.)  Grandfathers  are  more  closely  related  to  grandsons  than  grandmothers  to 
granddaughters,  the  mean  correlations  being  as  "3965  and  '2693  respectively.  It 
will  be  noted  at  once  that  these  five  rules  are  identical  with  those  we  have  obtained 
for  the  avuncular  correlations.  So  that  there  is  small  doubt  that  they  are  general 
rules  relating  to  all  grades  of  relationship  for  this  character. 

It  seems  to  me  probable  that  the  correct  form  of  (iii.)  is  :  Paternal  grandfathers  are 
more  highly  correlated  with  grandchildren  ("4006)  than  maternal  grandfathers  ("3343), 
and  paternal  grandmothers  ("2468)  less  highly  correlated  than  maternal  grandmothers 
('2851).  I  have  not  stated  the  rule  in  this  form,  because  it  is  not  confirmed  by 
the  corresponding  results  for  uncles  and  aunts.  Paternal  uncles  ('3024)  are  more 
closely  correlated  with  nephews  and  nieces  than  maternal  uncles  ('2722),  but  paternal 
aunts  (•2414)  are  slightly  more  instead  of  less  correlated  witli  nephews  and  nieces 
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than  maternal  aunts  (•2338).  I  consider,  however,  that  the  correlation  of  paternal 
aunt  and  nephew  (•2837)  in  our  series  is  abnormally  high. 

Now  it  will,  I  believe,  be  seen  that  the  investigation  of  the  eight  avuncular  and 
the  eight  grandparental  relationships,  here  made  for  the  first  time,*  enables  us  to 
draw  far  wider  conclusions  than  when,  as  hitherto,  only  parental  and  fraternal  corre- 
lations are  dealt  with.  In  making  the  subjoined  general  statements,  however,  I 
must  emphasise  the  following  limitations  : — 

(a.)  The  rules  are  deduced  only  from  data  for  one  character  in  one  type  of  life. 

(yS.)  This  character  appears  to  be  undergoing  a  secular  change,  a  change  very 
possibly  due  to  a  correlation  between  eye-colour  and  fertility  in  woman.  Thus  such 
a  change  might  not  unlikely  differentiate  the  male  and  female  influences  in  heredity. 

My  conclusions,  definitely  true  for  eye-colour  in  man,  and  at  the  very  least 
suggestive  for  investigations  on  other  characters  in  other  types  of  life,  are  : — 

(i.)  That  the  younger  generation  takes,  as  a  whole,  more  after  its  male  than  its 
female  ascendants  and  collaterals. 

(ii.)  That  the  younger  generation  is  more  highly  correlated  with  an  ascendant  or 
collateral  of  the  same  than  of  the  opposite  sex. 

(iii.)  That  the  younger  generation  is  more  highly  correlated  with  an  ascendant  or 
higher  collateral  reached  by  a  line  passing  through  one  sex  only  than  if  the  line 
changes  sex. 

Thus  correlation  is  greater  with  a  paternal  uncle  than  with  a  maternal  uncle,  or 
with  a  maternal  grandmother  than  a  paternal  grandmother, 

(iv,)  Males  are  more  highly  correlated  with  their  ascendants  and  collaterals  than 
females  are. 

The  above  rules  apply  to  the  averages ;  individual  exceptions  will  be  generally 
found  to  arise  from  a  conflict  of  rules.  Thus  (ii.)  and  (iii,)  may  in  special  cases  come 
into  conflict  with  (i,).  When  we  have  more  data  for  a  greater  variety  of  characters, 
we  shall  see  better  the  relative  weight  of  these  rules  in  cases  where  they  conflict, 

(/)  Exclusive  Inheritance  in  Eye-Colour. — A  cursory  examination  of  the  eye- 
colour  records  shows  at  once  how  rare  is  a  blend  of  the  parental  tints.  Even  when 
such  is  recorded,  it  is  by  no  means  clear  that  we  have  not  to  deal  with  a  medium  tint 
which  is  really  a  case  of  reversion  to  a  medium  tinted  ancestor.  The  failure  of  eye- 
colour  to  blend  is,  I  think,  well  illustrated  by  what  Mr,  Galton  has  termed  cases  of 
"  particulate  "  inheritance.  In  the  thousands  of  eye-colours  I  have  been  through,  I 
noticed  some  half-dozen  cases  only  in  which  the  two  eyes  of  the  same  individual  were 
of  diflerent  tint,  or  the  iris  of  one  pupil  had  streaks  of  a  second  tint  upon  it.  t 

*  I  anticipate  equally  valuable  results  when  characters  are  first  correlated  for  the  nine  possible  cousin 
series. 

t  In  the  same  manner  the  occurrence  of  particulate  inheritance  in  coat-colour  in  horses  may  be  really 
an  argument  against  the  existence  of  blends.  In  the  many  volumes  of  the  studbooks  I  have  examined, 
the  recorded  instances  of  piebalds  are  vanishingly  few  in  numljer. 
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If  we  allow  that  it  is  from  the  theory  of  exclusive  inheritance  that  we  must  seek 
results  in  the  present  cases,  we  see  that  for  parental,  collateral,  and  avuncular  relation- 
ships we  get  quite  excellent  results,  but  that  the  grandparental  relationship  is  some- 
what anomalous.  A  priori  it  might  appear  that  reversion  would  aid  us  in  increasing 
the  correlation  between  offspring  and  remote  ascendants.  But,  as  I  have  shown  else- 
where,* this  superficial  view  of  reversion  forgets  that  the  parents  as  well  as  the 
offspring  revert,  and  if  we  increase  the  grandparental  correlation  above  "25,  we  at  once 
reach  difficulties  in  the  values  of  the  parental  correlation,  provided  we  adopt  what 
appear  to  be  reasonable  assumptions  as  to  reversion  being  a  continuous  and  decreasing 
factor  from  stage  to  stage  of  ancestry.  I  am  inclined  accordingly  to  suspend  judg- 
ment on  the  grandparental  relationships,  thinking  that  the  smallness  of  the  number 
of  families  dealt  with  in  Mr.  Galton's  data  (200)  may  have  something  to  do  with  my 
peculiar  results.  Meanwhile  I  shall  endeavour  to  get  the  remaining  six  grandparental 
tables  for  thoroughbred  horses  worked  out,  and  see  whether  they  confirm  the  high 
values  already  found  for  the  two  maternal  grandsires  and  offspring,  or  give  an  average 
value  much  nearer  "25. 

That  the  reader  may  see  at  a  glance  the  general  results  hitherto  obtained  in  this 
and  other  papers,  I  append  the  following  table  of  inheritance  : — 


*  See  my  paper  on  "  The  Law  of  Reveisioii,"  'Roy.  See.  Proc.,'  vol.  66,  p.  140  et  seq.  Also  'The 
Grammar  of  Science,'  second  edition,  1900,  pp.  486-96,  "  On  Exclusive  Inheritance." 
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Table  IX. — Theoretical  and  Actual  Results  for  Inheritance. 


Theory. 

Man. 

Horse. 

Hound. 

Daphnia. 

Kelationsnip. 

Blended 
inherit- 
ance.^ 

Exclusive 
inherit- 
ance.'^ 

Stature^. 

ileacl 
index.* 

Lye-^ 
colour.'^ 

Coat- 
colour.''' 

Coat- 
colour." 

Spine.^ 

Parental  .... 

•3000 

•5000 

•3355 

•3348 

•4947 

•5216 

•3507 

[•3295] 

Mid-parental .    .  . 

•4242 

•4745 

•4735 

— 

— 

•4660 

Grandparental  . 

•1.500 

•2500 

•3166 

•3353 

•1340 

[•1360] 

G.  Grandparental  . 

•0750 

•1250 

•0404 

Avuncular  . 

•1500 

•2500 

■2650 

Whole  sibling    .  . 

•4000 

•4  to  1-0 

•4034 

•4025 

•4749 

•6329 

•5170 

•6934 

Half  sibling  ,  . 

•2000 

•2  to  •S 

•3550 

•1646 

1  Mr.  Galton's  unmodified  hypothesis.  See  "  Law  of  Ancestral  Heredity,"  '  Roy.  Soc.  Proc.,'  vol.  62 
p.  397. 

2  "Without  any  reversion.  See  "  Law  of  Reversion,"  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq.  The  values 
for  the  fraternal  correlation  depend  on  the  degree  of  prepotency  of  either  parent  within  the  union. 

3  See  '  Phil.  Trans.,'  A,  vol.  187,  p.  270. 

See  'Roy.  Soc.  Proc.,'  vol.  62,  p.  413.  The  paternal  correlations,  for  reasons  stated  in  the  paper,  are 
excluded  from  the  result. 

^  See  p.  113     seq.  of  the  present  memoir. 

^  See  p.  98  et  seq.  of  the  present  memoir.    The  grandparental  correlation  is  based  on  two  cases  only. 
"  See  'Roy.  Soc.  Proc.,'  vol.  66,  p.  140  et  seq. 

^  See  'Roy.  Soc.  Proc.,'  vol.  65,  p.  154.  I  have  deduced  the  value  for  parents  and  grandparents  from 
Dr.  Warren's  results  for  midparent  and  midgrandparent.  The  value  for  whole  siblings  I  obtained  from 
Dr.  Warren's  measurements,  which  he  with  great  kindness  placed  at  my  disposal. 

(14.)  Conclusions. — The  course  of  this  investigation  has  not  been  without  diffi- 
culties, and  I  am  fully  prepared  to  admit  that  more  obscurity  and  greater  probable 
errors  are  likely  to  arise  when  we  deal  with  the  inheritance  of  a  character  not  directly 
measurable,  than  when  we  take  that  of  a  character  to  which  we  can  at  once  apply  a 
quantitative  scale.  But  I  contend  that  many  of  the  characters,  the  inheritance  of 
which  it  is  most  important  to  investigate,  do  not  at  present,  and  perhaps  never  will, 
admit  of  a  quantitative  measurement.  We  can  arrange  in  order,  we  can  classify,  we 
can  say  more  or  less  intense,  but  we  cannot  read  off  value  on  a  scale.  It  is  just  such 
characters  also,  which  the  not  highly  trained  observer  can  most  easily  appreciate  and 
record.  Hence  we  have  been  compelled  to  devise  some  method  of  dealing  with  them, 
and  the  present  paper  illustrates  how  the  method  invented  can  be  applied  to  reach 
results  of  considerable  interest  and  of  substantial  validity. 
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111  order  to  illustrate  the  method,  I  chose  two  characters,  coat-colour  in  horses  and 
eye-colour  in  man,  which  seemed  sufficiently  diverse  both  as  to  origin  and  species,* 
The  new  method  enabled  me  to  reach  results  for  half-brethren,  grandparents  and 
uncles  and  aunts,  which  had  not  yet  been  independently  considered.  The  conclu- 
sions arrived  at  for  eye-colour  in  man  at  no  point  conflict  with  those  for  coat-colour 
in  horses,  and  both  in  the  main  accord  with  the  theory  of  exclusive  inheritance  with- 
out reversion  herein  developed.    We  find — 

(i.)  No  approach  to  a  single  value  for  the  coefficient  of  inheritance  for  each  grade 
of  relationship ;  it  varies  widely  with  the  sex,  and  the  line  through  which  the 
relationship  is  traced. 

(ii.)  No  approach  in  average  values  to  those  which  would  be  indicated  by  Mr. 
Galton's  Law. 

Nor  does  the  modification  of  Mr.  Galton's  Law,  which  I  have  termed  the  Law  of 
Ancestral  Heredity,  give  better  results.  For,  if  we  cause  it  to  give  the  parental 
values,  it  then  renders  results  inconsistent  with  the  fraternal  values. 

(iii.)  There  is  agreement  with  the  theory  of  exclusive  inheritance  without  reversion 
for  the  parental,  avuncular  and  fraternal  series  ;  but  there  is  some  anomaly  in  the 
case  of  grandparental  inheritance.  This  requires  further  investigation,  and  possibly 
a  modification  of  our  views  on  the  nature  of  reversion. 

We  want  a  list  formed  of  characters  in  various  types  of  life,  which  are  supposed  to 
be  exclusively  inherited,  and  then  experiments  ought  to  be  made  and  statistics  col- 
lected with  regard  to  these  characters.  It  is  in  this  field  of  exclusive  inheritance 
that  we  must  look  for  real  light  on  the  problem  of  reversion. 

If  we  consider  the  three  known  forms  of  inheritance,  the  blended,  the  exclusive, 
and  the  particulate  (which  may  possibly  be  combined  in  one  individual,  if  we  deal 
with  different  organs) ;  if  we  consider  further  that  these  forms  may  possibly  have  to 
be  supplemented  by  others  not  yet  recognised  {e.g.,  reversional  theories  depending, 
say,  on  heterogamous  unions),  then  it  would  appear  that  the  time  is  hardly  ripe  even 
for  provisional  mechanical  theories  of  heredity.  What  we  require  to  know  first  is, 
the  class  of  organs  and  the  types  of  life  for  which  one  or  other  form  of  inheritance 
predominates.  As  variation  in  no  wise  depends  on  the  existence  of  tico  germ-plasms, 
so  biparental  heredity  can  by  no  means  be  treated  as  the  result  of  their  simple  quanti- 
tative mixture  ;  the  component  parts  of  these  germ-plasms  corresponding  to  special 
characters  and  organs,  must  be  able  to  act  upon  each  other  in  a  variety  of  qualita- 
tively different  ways.  To  adopt  for  a  moment  the  language  of  Darwin's  theory  of 
pangenesis,  the  multiplying  gemmules  from  an  organ  in  the  father  must  (i.)  cross  with 
gemmules  from  that  organ  in  the  mother,  and  the  hybrid  gemmules  give  rise  to 
blended  inheritance,  (ii.)  must  without  crossing  multiply  alongside  the  gemmules  of 
the  mother,  and  give  rise  to  particulate  inheritance,  (iii.)  must  alone  survive,  or  alone 

*  Since  supplemented  by  my  investigations,  based  on  Mr.  Galton's  data,  for  coat-colour  in  hounds, 
'  Roy.  Soc.  Proc.,'  vol,  66,  p.  140  d  xeq. 
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be  destroyed  in  a  struggle  for  existence  with  those  of  the  mother,  and  give  rise  to 
exclusive  inheritance.  And  all  these  three  processes  may  be  going  on  within  the 
same  germ-plasm  mixture  at  the  same  time  !  Even  without  using  the  language  of 
gemmules,  processes  analogous  to  the  above  must  be  supposed  to  take  place.  Thus  a 
quantitative  "  mixture  of  germ -plasms  "  becomes  a  mere  name,  screening  a  whole 
range  of  mechanical  processes ;  and  very  possibly  a  new  one  could  be  found  for  each 
new  form  of  heredity  as  it  occurs.  Such  processes  like  the  old  ones  would  still 
remain  without  demonstrable  reality  under  the  veil  of  "  mixture  of  germ-plasms." 

What  I  venture  to  think  we  require  at  present  is  not  a  hypothetical  plasmic 
mechanics,  but  careful  classifications  of  inheritance  for  the  several  grades  of  rela- 
tionship, for  a  great  variety  of  characters,  and  for  many  types  of  life.  This  will 
require  not  only  the  formation  of  records  and  extensive  breeding  experiments,  but 
ultimately  statistics  and  most  laborious  arithmetic.  Till  we  know  what  class  of 
characters  blend,  and  v^^hat  class  of  characters  is  mutually  exclusive,  we  have  not 
within  our  cognizance  the  veriest  outlines  of  the  phenomena  which  the  inventors  of 
plasmic  mechanisms  are  in  such  haste  to  account  for.  Such  inventors  are  like  planetary 
theorists  rushing  to  prescribe  a  law  of  attraction  for  planets,  the  very  orbital  forms  of 
which  they  have  not  first  ascertained  and  described.  Without  the  observations  of 
Tycho  Brahe,  followed  by  the  arithmetic  of  Keplee,  no  Newton  had  been  possible. 
The  numerical  laws  for  the  intensity  of  inheritance  must  first  be  discovered  from  -wide 
observation  before  plasmic  mechanics  can  be  anything  but  the  purest  hypothetical 
speculation. 

Appendix  I. 

Tables  of  Colour  Inheritance  in  Thoroughbred  Eacehorses,  extracted  by  Mr.  Leslie 
Beamley-Mooee  from  Weatherby's  Studbooks. 


Table  of  Coloues. 


1  =  black  (bl.) 

2  =  black  or  brown  (bl./br.). 

3  =  brown  or  black  (br./bl.). 

4  =  brown  (br.). 

5  =  brown  or  bay  (br./b.). 

6  =  bay  or  brown  (b./br.). 

7  =  bay  (b.). 

8  =  bay  or  chestnut  (b./ch.). 


9  =  chestnut  or  bay  (ch./b.). 

10  =  chestnut  (ch.). 

11  =  chestnut  or  roan  (ch./rc). 

12  =  roan  or  chestnut  (ro./ch.). 

13  =  roan  (ro.). 

14  =  roan  or  grey  (ro./gr.). 

15  =  grey  or  roan  (gr./ro.) 

16  =  grey  (gr.). 
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0 

0 

0 

CO 

(M 

0 

to 

lO 

10 
10 

0 

CO 
(M 

0 

0 

0 

0 

0 

0 

10 
10 

r-H 

CO 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 — 1 

0 

0 

10 

I-H 

0 

T— 1 

0 

0 

0 

0 

0 

0 

r-H 

CO 

I-H 

0 

to 

I-H 

00 

0 

0 

0 

0 

0 

0 

0 

CO 

bl./br. 

br./bl. 

u 

br./b. 

b./br. 

rd 

b./ch. 

-d^ 

rd 

0 

-d 

0 

ch./ro. 

ro./ch. 

d 
u 

ro./gr. 

gr./ro. 

u 

&c 

I— 1 

C-7 

CO 

10 

CO 

00 

05 

0 

1 — 1 

CO 

lO 

CO 

Totals... 
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Colts. 


o 

I— I 

w 

®  CC 

CQ 

CO  g 
6  ^ 

P  ^ 

t  ^  ^ 


o3 


02 


O 


Totals. 

00 

1 — 1 

1 — 1 

o 

I— 1 

1  — 

to 

cq 

OS 

cq 

CI 

o 

CO 

1000 

I— 1 

o 

o 

o 

o 

o 

T-H 

C-1 

o 

o 

o 

O 

o 

o 

o 

o 

1 — 1 

lO 
1— 1 

gr./ro. 

o 

o 

o 

1 — 1 

o 

o 

o 

o 

o 

O 

o 

o 

o 

o 

o 

cq 

r— 1 

ro./gr. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

CO 
r— 1 

d 

o 

o 

o 

T—H 

o 

o 

CO 

o 

o 

l-H 

o 

o 

o 

o 

o 

o 

in 

I-H 

ro./ch. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

r— 1 
1—1 

ch./ro. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

O 
1 — 1 

ch. 

o 

o 

CI 
CO 

l-H 

»o 

cq 

l-H 

o 

o 

to 

l-H 

o 

o 

I-H 

o 

o 

o 

cq 

oj 

ch./b. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

O 

00 

b./ch. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

O 

rd 

o 

I-H 

1^ 

o 

00 
I — 1 

CO 
to 
o^ 

o 

o 

in 

cq 

l-H 

o 

o 

l-H 

o 

T-H 

l-H 

OS 

b./br. 

o 

o 

o 

00 

l-H 

o 

l-H 

cq 
cq 

o 

o 

in 

l-H 

o 

o 

o 

o 

o 

in 

id 

br./b. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

O 

o 

o 

o 

o 

o 

o 

O 

cq 

I— 1 

CO 

CO 

to 

o 

o 

00 

o 

o 

o 

o 

o 

o 

in 

r-H 

CO 

br./bl. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

cq 

bl./br. 

o 

o 

o 

o 

l-H 

o 

00 

o 

o 

cq 

o 

o 

o 

o 

o 

o 

l-H 

r— 1 

l-H 

o 

l-H 

o 

CO 

o 

o 

in 

o 

o 

o 

o 

o 

o 

o 

o 

CO 
l-H 

3 

u 

!3 

br./bl. 

br./b. 

b./br. 

nd 

b./ch. 

rd 

o 

rd 

o 

ch./ro. 

ro./ch. 

d 

ro./gr. 

gr./ro. 

bO 

i-Hcqeo-^intot^oooiOi-Hcqco^into 
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Colts. 


u 

03 

CQ 
•  I— t 

CO 
© 

'~6 


d 

-1-3  (D 

O  tl 

t  1 

ft 

.rH  O 

o 

©  .2 

M  a 

?-l 

03 
-M 


X 

pq 

e1 


Totals. 

o 

I-H 

00 
CO 

I-H 

I—I 

CO 

(M 

Oi 

o 

o 

I— 1 

I— 1 

o 

o 

o 

o 

1000 

CO 

ac 

O 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

lO 

gr./ro. 

o 

o 

o 

r— 1 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

1—1 

ro./gr. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

CO 
1— ( 

d 

o 

o 

o 

o 

o 

o 

o 

o 

o 

1 — 1 

o 

o 

o 

o 

o 

o 

I— 1 

T-H 

ro./ch. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

I-H 

ch./ro. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

rd 

O 

CO 

1 — 1 

o 

o 

CO 

o 

o 

T— 1 

1 — 1 

o 

o 

o 

o 

o 

(M 

CM 
lO 
(M 

as 

ch./b. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

00 

b./ch. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

rd 

CO 

1 — 1 

o 

CD 

in 

CO 
CO 

o 

o 

OS 

I— 1 

o 

o 

o 

o 

<M 

o 

CM 

CO 

b./br. 

o 

o 

o 

o 

CO 

rH 

o 

o 

o 

o 

o 

o 

o 

CM 

o 

CO 

br./b. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

>o 

o 

o 

CO 
CO 

o 

CO 
CO 

o 

o 

CO 
CM 

o 

o 

o 

o 

o 

o 

<M 
CO 

1 — 1 

CO 

br./bl. 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

cm' 

o 

<?q 

o 

1—1 

o 

o 

ip 

o 

o 

o 

o 

o 

o 

o 

1 — 1 

o 

1—5 

CO 

o 

I— 1 

CO 

o 

1 — 1 

00 

o 

o 

1—1 

o 

o 

o 

o 

o 

o 

o 

<M 

bl./br. 

br./bl. 

br./b. 

br./b. 

rd 

b./ch. 

ch./b. 

rd 

o 

d 

rd 

o 

ro./ch. 

d 

ro./gr. 

gr./ro. 

I-H 

CO 

in 

CO 

00 

CI 

o 

r-H 

1 — 1 
I — 1 

<M 
] — 1 

CO 
rH 

-* 
1 — 1 

CO 

I— 1 

Totals  ... 
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Appendix  II. 


Tables  of  Eye-colour  Inheritance  in  Man,  extracted  by  Kael  Pearson  from 
Mr.  Francis  Galton's  Family  Records. 


Table  of  Tints. 

1  =  light  blue.  5  =  light  brown. 

2  =  blue,  dark  blue.  6  =  brown. 

3  =  blue-green,  grey.  7  =  dark  brown. 

4  =  dark  grey,  hazel.  8  =  very  dark  brown,  black. 

This  grouping  is  not  quite  in  keeping  with  more  recent  divisions  of  eye-colour,  but 
being  that  adopted  by  Mr.  Galton  in  his  original  collection  of  data,  it  could  not  be 
modified  in  accordance  with  present  practice. 

Tables  for  the  Direct  Inheritance  of  Eye-colour.    First  Generation. 
I. — Fathers  and  Sons.    1000  Cases. 


Fathers. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

9 

12 

5 

5 

0 

1 

2 

0 

34 

2 

10 

163 

65 

36 

1 

7 

15 

4 

301 

3 

10 

73 

124 

41 

1 

12 

18 

5 

284 

4 

4 

21 

34 

55 

0 

11 

11 

1 

137 

5 

0 

0 

0 

0 

2 

2 

1 

0 

5 

6 

1 

26 

12 

19 

1 

19 

16 

6 

100 

7 

1 

23 

16 

14 

0 

11 

31 

2 

98 

8 

1 

4 

8 

10 

0 

1 

7 

10 

41 

Totals 

36 

322 

264 

180 

5 

64 

101 

28 

1000 

II. — Fathers  and  Daughters.    1000  Cases. 


Fathers. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

4 

9 

11 

4 

0 

1 

2 

1 

32 

2 

11 

139 

57 

31 

0 

6 

24 

5 

273 

3 

9 

73 

111 

38 

1 

15 

19 

3 

269 

4 

5 

43 

34 

54 

2 

10 

14 

3 

165 

5 

0 

1 

3 

0 

0 

3 

1 

0 

8 

6 

1 

45 

13 

19 

0 

23 

15 

3 

119 

7 

2 

27 

10 

12 

0 

7 

41 

6 

105 

8 

0 

8 

4 

0 

0 

2 

11 

4 

29 

Totals 

32 

345 

243 

158 

3 

67 

127 

25 

1000 
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III. — Mothers  and  Sons.    1000  Cases. 


Mothers. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

5 

14 

6 

3 

0 

1 

0 

6 

35 

2 

12 

119 

83 

29 

8 

20 

21 

9 

301 

3 

13 

54 

113 

35 

4 

37 

14 

8 

278 

4 

3 

21 

26 

54 

1 

17 

6 

6 

134 

5 

0 

1 

1 

0 

0 

3 

0 

0 

5 

6 

1 

9 

26 

10 

1 

30 

24 

3 

104 

7 

1 

9 

19 

16 

0 

18 

31 

7 

101 

8 

0 

7 

15 

4 

1 

3 

5 

7 

42 

Totals 

35 

234 

289 

151 

15 

129 

101 

46 

1000 

IV. — Mothers  and  Daughters.    1000  Cases. 


Mothers. 


Tints. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

j  Totals. 

1 

5 

15 

3 

2 

2 

2 

2 

0 

31 

2 

7 

99 

67 

29 

2 

15 

23 

13 

255 

3 

7 

77 

111 

38 

1 

26 

14 

6 

280 

4 

5 

22 

34 

46 

2 

27 

21 

7 

164 

5 

0 

2 

2 

0 

3 

1 

2 

1 

11 

6 

0 

13 

27 

20 

1 

35 

17 

7 

120 

7 

0 

13 

21 

16 

1 

19 

26 

9 

105 

8 

1 

5 

7 

2 

1 

4 

12 

2 

34 

Totals. 

25 

246 

272 

153 

13 

129 

117 

45 

1000 

Tables  for  the  Collateral  Inheritance  of  Eye-colour. 
V^  —  Brothers  and  Brothers.    1500  Cases. 

First  Brother. 


First. 

1. 

2. 

3. 

4. 

5. 

6. 

7 

8. 

Totals. 

i 

1 

8 
36 

2 

3 

4 

0 

0 

0 

19 

2 

202 

23 

17 

0 

6 

4 

3 

291 

3 

16 

182 

209 

26 

0 

4 

2 

2 

441 

4 

6 

36 

71 

84 

0 

7 

2 

0 

206 

5 

0 

3 

2 

1 

0 

1 

0 

0 

7 

6 

3 

56 

50 

39 

0 

34 

5 

6 

193 

7 

6 

37 

76 

48 

1 

36 

36 

2 

242 

8 

.  4 

24 

26 

18 

0 

8 

6 

15 

101 

Totals 

79 

542 

460 

237 

1 

96 

55 

30 

1500  j 
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Yl\ — Sisters  and  Sisters.    1500  Cases. 


First  Sister. 


First. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8.      ■  Totals. 

j 

1 

10 

2 

1 

1 

0 

0 

0 

0 

14 

2 

17 

147 

29 

6 

0 

10 

6 

2 

217 

3 

10 

136 

186 

24 

0 

9 

5 

3 

373 

4 

3 

75 

94 

66 

0 

1 

10 

0 

249 

5 

0 

3 

5 

2 

2 

0 

1 

0 

13 

6 

2 

57 

69 

55 

5 

52 

9 

2 

251 

7 

4 

56 

61 

52 

10 

59 

49 

0 

291 

8 

2 

20 

10 

7 

6 

13 

26 

8 

92 

Totals 

48 

496 

455 

213 

23 

144 

106 

15 

1500 

V'. — Brothers  and  Brothers.    Symmetrical  System. 


Tint. 

i 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

16 

38 

19 

10 

0 

3 

6 

6 

98 

2 

38 

404 

205 

53 

3 

62 

41 

27 

833 

3 

19 

205 

418 

97 

2 

54 

78 

28 

901 

4 

10 

53 

97 

168 

1 

46 

50 

18 

443 

5 

0 

3 

2 

1 

0 

1 

1 

0 

8 

6 

3 

62 

54 

46 

1 

68 

41 

14 

289 

7 

6 

41 

78 

50 

1 

41 

72 

8 

297 

8 

6 

27 

28 

18 

0 

14 

8 

30 

131 

Totals 

98 

833 

901 

443 

8 

289 

297 

131 

3000 

YI''. — Sisters  and  Sisters.    Symmetrical  System. 


Tint. 

1- 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

20 

19 

11 

4 

0 

2 

4 

2 

62 

2 

19 

294 

165 

81 

3 

67 

62 

22 

713 

3 

11 

165 

372 

118 

5 

78 

66 

13 

828 

4 

4 

81 

118 

132 

2 

56 

62 

7 

462 

5 

0 

3 

5 

2 

4 

5 

11 

6 

36 

6 

2 

67 

78 

56 

5 

104 

68 

15 

395 

7 

4 

62 

66 

62 

11 

68 

98 

26 

397 

8 

2 

22 

13 

7 

6 

15 

26 

16 

107 

Totals 

62 

713 

828 

462 

36 

395 

397 

107 

3000 
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VII. — Brothers  and  Sisters.    1500  Cases. 


Brothers. 


Tint. 

L 

2. 

3. 

4. 

5. 

6. 

7. 

1 

8. 

Totals. 

1 

5 

9 

18 

4 

0 

0 

1 

0 

37 

2 

20 

163 

101 

36 

0 

28 

19 

13 

380 

3 

9 

98 

193 

50 

0 

37 

17 

14 

418 

4 

5 

36 

49 

67 

3 

28 

13 

16 

217 

5 

0 

2 

.5 

1 

1 

2 

2 

3 

16 

6 

3 

47 

41 

27 

4 

42 

17 

14 

195 

7 

4 

34 

49 

22 

3 

27 

30 

19 

178 

8 

1 

10 

1 

0 

10 

8 

22 

59 

Totals 

47 

399 

463 

208 

11 

174 

107 

91 

1500 

Table  for  Assortative  Mating  in  Eye -colour. 
VIII. — Husbands  and  Wives.    774  Cases. 

Husbaiids. 


Tint. 

1 

1. 

2. 

3. 

4. 

5. 

6. 

7.  8. 

Totals. 

1 

2 

13 

4 

3 

0 

1 

2 

0 

25 

2 

1  6 

87 

42 

26 

0 

16 

13 

6 

196 

3 

\  .6 

56 

93 

31 

1 

16 

11 

6 

220 

4 

4 

32 

35 

18 

1 

15 

6 

1 

112 

5 

0 

0 

5 

1 

0 

0 

1 

0 

7  ■ 

6 

2 

38 

27 

10 

1 

12 

10 

1 

101 

7 

5 

20 

28 

7 

1 

6 

12 

4 

83 

8 

!  2 

8 

8 

2 

0 

2 

4 

4 

30 

Totals 

27 

254 

242 

98 

4 

68 

59 

22 

774 

Tables  for  the  Direct  Inheritance  of  Eye-colour.    Second  Generation. 
IX. — Paternal  Grandfather  and  Grandson,    765  Cases. 


Paternal  Gmndfather. 


First. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

4 

10 

3 

0 

0 

1 

3 

0 

21 

2 

7 

115 

31 

20 

1 

6 

13 

3 

196 

3 

5 

64 

109 

21 

0 

10 

22 

4 

235 

4 

2 

25 

40 

21 

0 

9 

13 

6 

116 

5 

0 

0 

0 

1 

0 

1 

0 

0 

2 

6 

0 

14 

32 

11 

0 

15 

5 

2 

79 

7 

4  ■ 

16 

16 

9 

1 

11 

21 

2 

80 

8 

0 

6 

5 

0 

0 

3 

16 

6 

36 

Totals 

22 

250 

236 

83 

2 

56 

93 

23 

765 
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X. — Paternal  Grandfather  and  Granddaughter.    681  Cases. 

Paternal  Grandfather. 


First. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

3 

6 

4 

5 

0 

1 

1 

0 

20 

2 

2 

94 

32 

10 

2 

6 

16 

4 

166 

3 

5 

67 

71 

17 

0 

9 

20 

3 

192 

4 

4 

35 

33 

26 

1 

10 

9 

3 

121 

5 

0 

3 

4 

1 

0 

1 

2 

1 

12 

6 

1 

16 

21 

11 

0 

15 

6 

4 

74 

7 

3 

10 

20 

11 

1 

8 

15 

3 

71 

8 

0 

2 

5 

1 

1 

1 

10 

5 

25 

Totals 

18 

233 

190 

82 

5 

51 

79 

23 

681 

I 


Table  XI. — Maternal  Grandfather  and  Grandson.    771  Cases. 

Maternal  Grandfather. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

3 

11 

3 

1 

1 

t 
1 

1  19 

2 

8 

113 

46 

22 

1 

13 

5 

3 

211 

3 

10 

87 

89 

12 

11 

8 

3 

220 

4  i 

5  ! 

4 

33 

35 

22 

15 

6 

2 

117 
2 

6 

2 

1 

25 

1 
25 

7 

14 

7 

4 

84 

7 

1 

22 

26 

6 

2 

9 

10 

4 

80 

8 

12 

12 

6 

4 

3 

1 

38 

Totals 

28 

304 

237 

76 

3 

66 

40 

17 

771 

6 


Table  XII. — Maternal  Grandfather  and  Granddaughter.    687  Cases. 

Maternal  Grandfather. 


Tint. 

1. 

i 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

1 

3 

7 

2 

1 

14 

2 

8 

84 

35 

11 

6 

2 

159 

3 

11 

67 

76 

18 

7 

15 

5 

199 

4 

7 

41 

40 

14 

15 

11 

5 

133 

5 

5 

2 

2 

1 

10 

6 

4 

21 

32 

1 

16 

5 

2 

81 

7 

4 

15 

14 

8 

1 

7 

19 

2 

70 

8 

5 

5 

1 

1 

5 

4 

21 

Totals 

35 

i 

241 

211 

55 

1 

61 

63 

20 

687 

I 

I 
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Table  XIII. — Paternal  Grandfather  and  Grandson.    741  Cases. 


Paternal  Grandmother. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

1 

2 

■  7 

1 

1 

2 

3 

17 

2 

6 

62 

69 

22 

4 

15 

25 

4 

207 

3 

4 

31 

95 

22 

1 

25 

33 

9 

220 

4 

3 

18 

36 

20 

4 

15 

16 

4 

116 

5 

1 

1 

1 

3 

6 

1 

10 

23 

6 

2 

16 

10 

4 

72 

7 

3 

15 

10 

4 

1 

14 

13 

11 

71 

8 

1 

10 

3 

3 

5 

5 

8 

35 

Totals 

19 

148 

243 

79 

13 

92 

106 

41 

741 

Table  XIV, — Paternal  Grandmother  and  Granddaughter.    717  Cases. 


Paternal  Grandmother. 


Tint. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

3 

3 

2 

6 

2 

16 

2 

7 

53 

56 

14 

2 

13 

28 

8 

181 

3 

8 

35 

65 

22 

6 

29 

28 

7 

200 

4 

4 

29 

36 

20 

3 

16 

23 

8 

139 

5 

1 

3 

1 

5 

1 

11 

6 

9 

29 

2 

4 

27 

8 

3 

82 

7 

10 

15 

12 

2 

10 

12 

7 

68 

8 

1 

2 

1 

2 

1 

4 

4 

5 

20 

Totals 

21 

141 

208 

75 

18 

104 

110 

40 

717 

Table  XV. — Maternal  Grandmother  and  Grandson.    756  Cases 


Maternal  Grandmother. 


Tint. 

j 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

1 

10 

1 

3 

3 

1 

19 

2 

10 

68 

53 

23 

24 

13 

13 

204 

3 

9 

39 

67 

38 

32 

23 

11 

219 

4 

3 

34 

19 

30 

19 

8 

4 

117 

5 

1 

1 

2 

6 

20 

11 

10 

24 

18 

1 

84 

7 

2 

9 

23 

11 

1 

17 

17 

80 

8 

4 

6 

5 

6 

3 

7 

31 

Totals 

25 

184 

181 

121 

1 

122 

85 

37 

756 
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Table  XVI. — Maternal  Grandmother  and  Granddaughter.    739  Cases. 


Maternal  Grandmother. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

2 

16 

18 

2 

7 

66 

34 

13 

•21 

15 

6 

162 

3 

12 

62 

55 

25 

1 

27 

23 

5 

210 

4 

6 

32 

36 

25 

23 

15 

7 

144 

5 

1 

3 

2 

3 

3 

12 

6 

1 

14 

21 

11 

27 

17 

2 

93 

7 

19 

17 

16 

17 

3 

76 

8 

5 

4 

7 

3 

3 

24 

Totals 

28 

212 

170 

85 

1 

124 

93 

26 

739 

Tables  for  the  Collateral  Inheritance  of  Eye-colour.    Second  Degree. 
XVII. — Paternal  Uncle  and  Nephew.    1290  Cases. 


Paternal  Uncle. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Total. 

1 

4 

10 

11 

6 

1 

4 

5 

2 

43 

2 

11 

136 

98 

40 

0 

26 

48 

12 

371 

3 

8 

84 

157 

26 

1 

27 

54 

7 

364 

4 

0 

29 

69 

36 

1 

19 

27 

12 

193 

5 

0 

2 

1 

0 

0 

2 

1 

0 

6 

6 

1 

31 

35 

7 

1 

30 

19 

3 

127 

,  7 

2 

21 

27 

24 

1 

13 

34 

11 

133 

8 

0 

11 

7 

6 

0 

10 

8 

11 

53- 

Total 

i 

26 

324 

405 

145 

5 

131 

196 

58 

1290 

XVIII. — Paternal  Uncle  and  Niece.    1128  Cases. 
Paternal  Uncle. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Total. 

1 

!  2 

10 

6 

2 

0 

1 

6 

2 

29 

2 

:  7 

85 

61 

27 

0 

29 

26 

13 

248 

3 

!  5 

82 

126 

29 

1 

26 

43 

7 

319 

4 

2 

47 

73 

40 

1 

29 

40 

5 

237 

5 

1 

8 

1 

4 

0 

1 

5 

4 

24 

6 

1 

26 

35 

12 

1 

8 

42 

3 

128 

7 

1 

20 

25 

19 

0 

22 

26 

7 

120 

8 

0 

4 

2 

3 

0 

5 

3 

6 

23 

Total 

26 

282 

329 

136 

3 

121 

191 

47 

1128 
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Table  XIX. — Maternal  Uncle  and  Nephew.    1242  Cases. 


Maternal  Uncle. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

8. 

Totals. 

1 

1 

8 

13 

3 

0 

3 

4 

1 

33 

2 

17 

137 

71 

29 

0 

19 

14 

9 

296 

3 

10 

128 

153 

26 

0 

29 

34 

3 

383 

4 

2 

50 

62 

28 

0 

22 

14 

1 

179 

5 

0 

1 

0 

0 

0 

0 

1 

0 

2 

6 

4 

33 

29 

12 

0 

35 

20 

3 

136 

7 

1 

33 

40 

11 

26 

27 

2 

140 

8 

0 

9 

17 

23 

8 

3 

13 

73 

Totals  1 

35 

399 

385 

132 

0 

142 

117 

32 

1242 

Table  XX. — Maternal  Uncle  and  Niece.    1434  Cases. 

Maternal  Uncle. 


Tint. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

2 

15 

9 

2 

0 

2 

1 

0 

31 

2 

6 

99 

76 

23 

0 

18 

13 

11 

246 

3 

18 

100 

108 

23 

0 

37 

36 

10 

332 

4 

4 

72 

64 

28 

0 

16 

21 

9 

214 

5 

0 

14 

2 

3 

0 

8 

5 

0 

32 

6 

5 

38 

41 

10 

0 

23 

11 

4 

132 

7 

1 

27 

25 

7 

0 

19 

14 

3 

96 

8 

0 

15 

5 

6 

0 

9 

11 

5 

51 

Totals 

36 

380 

330 

102 

0 

132 

112 

42 

1134 

Table  XXI. — Paternal  Aunt  and  Nephew.    1186  Cases. 


Paternal  Aunt. 


Tints. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

6 

13 

5 

3 

1 

4 

6 

4 

42 

2 

19 

113 

83 

45 

36 

29 

5 

330 

3 

10 

81 

147 

30 

29 

35 

8 

340 

4 

8 

28 

66 

38 

18 

22 

11 

191 

5 

0 

6 

3 

23 

35 

12 

1 

35 

10 

5 

124 

7 

5 

22 

28 

19 

18 

15 

5 

112 

8 

1 

4 

9 

8 

6 

4 

13 

47 

Totals 

52 

284 

373 

155 

2 

148 

121 

51 

1186 
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Table  XXII. — Paternal  Aunt  and  Niece.    1149  Cases. 


Paternal  Aunt. 


Tints. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

2 

11 

2 

3 

2 

11 

2 

33 

2 

15 

89 

62 

37 

2 

25 

40 

14 

284 

3 

12 

93 

119 

40 

3 

41 

26 

12 

346 

4 

10 

36 

62 

43 

5 

25 

21 

11 

213 

5 

5 

7 

1 

3 

16 

6 

1 

24 

33 

16 

1 

29 

19 

5 

128 

7 

2 

20 

28 

12 

2 

10 

22 

4 

100 

8 

7 

4 

5 

9 

4 

29 

Totals 

42 

285 

313 

155 

13 

138 

151 

52 

1149 

Table  XXIII. — Maternal  Aunt  and  Nephew,    1145  Cases. 


Maternal  Aunt. 


First. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

4 

8 

7 

3 

2 

3 

1 

28 

2 

5 

117 

81 

29 

43 

29 

6 

310 

3 

1 

73 

132 

38 

57 

43 

3 

347 

4 

1 

20 

54 

27 

1 

21 

11 

135 

5 

3 

2 

1 

6 

6 

24 

35 

22 

30 

23 

3 

137 

7 

26 

29 

20  . 

1 

26 

25 

8 

135 

8 

14 

5 

10 

12 

4 

2 

47 

Totals 

i 

11 

282 

346 

151 

2 

192 

138 

23 

1145 

Table  XXIV. — Maternal  Aunt  and  Niece.    1058  Cases. 


Maternal  Aunt. 


J^■i^st. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

Totals. 

1 

2 

3 

10 

15 

1 

5 

87 

86 

31 

23 

14 

12 

258 

3 

3 

71 

125 

32 

1 

49 

41 

3 

325 

4 

39 

51 

31 

1 

33 

19 

6 

180 

5 

4 

6 

1 

8 

5 

3 

27 

6 

1 

25 

47 

10 

24 

9 

2 

118 

7 

30 

29 

11 

14 

18 

10 

112 

8 

5 

4 

5 

2 

4 

3 

23 

Totals 

11 

264 

358 

121 

2 

153 

110 

39 

1058 
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[Notes  added  July  3,  1900. 

Note  I.  Inheritance  of  Temper  and  Artistic  Instinct.— In  additional  to  the  fraternal 
correlations  given  on  p.  102,  I  have  dealt  with  Mr.  Galton's  statistics  for  the  inheri- 
tance of  good  and  bad  temper  given  in  his  'Natural  Inheritance'  (p.  235).  The 
following  gives  the  distribution  of  good  and  bad  temper  among  1,294  brethren,  as 
deduced  by  Mr.  Yule. 


Fiist  Brother. 


Good  Temper. 

Bad  Temper. 

Totals. 

Good 
temper. 

330 

255 

585 

Bad 
temper. 

255 

454 

j 

709 

Totals 

585 

709 

:  1294 

The  correlation  is  "3167. 

A  like  table  is  that  for  artistic  instinct  in  the  direct  line  : — 


Parentage. 


Artistic. 

Non-artistic. 

Totals. 

o 

'-C 

cc 
+3 

296 

173 

469 

1^ 

<l 

i 

O  .2 

372 

666 

1038 

Totals 

668 

839 

1507 

In  this  case  the  correlation  is  '4039. 

The  fraternal  correlation  is  somewhat  low.  The  exact  significance  of  the  parental 
correlation  is  also  somewhat  vague,  as  the  parentage  is  classified  as  artistic  when 
one  or  hath  parents  are  artistic.  But  the  tw^o  tables  are  very  suggestive,  they 
indicate  how  the  new  method  will  enable  us  to  deal  quantitatively  even  with 
characters  like  temper  and  artistic  instinct  to  which  it  is  impossible  to  apply  directly 
a  quantitative  scale.  With  the  introduction  of  a  third  or  medium  class,  I  believe  it 
will  be  possible  to  obtain  excellent  results  for  heredity  from  very  simple  observations, 
and  I  have  in  hand  at  the  present  time  a  large  series  of  observations  on  collateral 
heredity  based  upon  such  simple  classifications.    The  reader  should  further  consult 

u  2 
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Mr.  G.  U.  Yule's  remarks  on  the  association  of  temper  and  of  artistic  instinct  in  his 
memoir  on  "  Association,"  '  Phil.  Trans.,'  A,  vol.  194,  p.  290,  1900. 

Note  II.  On  the  Correlation  of  Fertility  and  Eye-Colour. — In  the  course  of  the 
present  paper  I  have  frequently  referred  to  a  probable  influence  of  reproductive 
selection  as  the  source  of  the  progressive  change  in  eye-colour,  i.e.,  to  a  possibility  that 
eye-colour  is  correlated  with  fertility.  I  saw  from  Mr.  Galton's  tables  that  in 
many  cases  the  whole  family  had  not  been  recorded,  probably  the  eye-colour  of  the 
dead  or  of  absentees  being  unknown.  It  appeared  to  me  accordingly  that  it  would 
be  impossible  to  deal  directly  with  the  problem  of  fertility.  However,  it  has  since 
occurred  to  me  that  there  is  nothing  likely  to  give  the  missing  members  of  families  a 
bias  towards  one  rather  than  another  eye-colour,  and  that  we  may  simply  treat  them 
as  a  purely  random  subtraction  from  the  total  results.  Assuming  this,  Mr.  L.  N. 
FiLON,  M.A.,  has  prepared  for  me  tables  of  father's  and  mother's  eye-colour  and  of 
the  recorded  number  of  their  children.  From  these"^  I  take  first  the  following- 
results,  premising  (i.)  that  I  call  "  light  eyed,"  persons  with  eye-colours  1,  2  and  3,  and 
"  dark  eyed,"  persons  with  eye-colours  4,  5,  6,  7,  8,  i.e.,  drawing  the  line  between  light 
and  dark  grey;  (ii.)  that  I  take  as  small  families  those  with  0,  1,  2,  or  3,  recorded 
children  and  as  large  those  with  4  or  more  recorded  children. 


Father. 


Light  Eyed. 

Dark  Eyed. 

Totals. 

Small. 

,313 

Ul 

1 

454 

Large. 

264 

139 

403 

Totals 

577 

280 

857 

Mother 


Light  Eyed. 

Dark  Eyed. 

Totals. 

Small. 

253 

202 

455 

be 

225 

169 

394 

Totals 

478 

371 

849 

*  Correlation  taljles  were  prepared  of  the  size  of  families  0  to  15,  and  of  the  eye-colours  1  to  8,  but  it 
does  not  seem  needful  to  print  them  in  extenso. 
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We  have,  accordingly,  by  the  method  of  the  present  memoir  : — 
Correlation  of  size  of  family  with  darkness  of  eye-colour 

=       "0595,  for  fathers. 
=  —  -0239,  for  mothers. 

The  former  is  just  sensible,  the  latter  hardly  sensible  relative  to  the  probable  error. 
So  far  as  they  can  be  relied  upon,  they  would  denote  that  fathers  have  more  children 
the  darker  eyed  they  are,  and  mothers  more  children  the  lighter  eyed  they  are. 
This  is  in  accordance  with  the  result  given  in  the  memoir,  that  the  modern 
generation  is  darker  than  its  male  and  lighter  than  its  female  ancestry,  but  it  is  not 
the  explanation  given  in  the  text,  although  it  is  probably  the  true  one.  If  it  be  the 
true  one,  dark  fathers  and  light  mothers  ought  to  present  the  most  fertile  unions, 
and  it  seemed  desirable  to  test  this  directly.  We  have  already  seen  that  there 
exists  an  assortative  mating  in  eye-colour,  like  tending  to  mate  with  like,  the 
co-efficient  of  correlation  being  about  "1  ;  hence  if  we  were  to  correlate  the  eye-colour 
of  mothers  and  fathers,  i.e.,  husbands  and  wives  weighted  with  their  fertility,  we 
ought  to  find  this  result  substantially  reduced.    The  following  is  the  table  : — 


Fathers. 


Light  Eyed. 

Dark  Eyed. 

Totals. 

Light  Eyed. 

1183 

612 

1795 

? 

Dark  E; 

826 

455 

1281 

Totals 

2009 

1067 

3076 

We  find  r  =  -0239,  or  the  correlation  has  been  reduced  to  a  fifth  of  its  previous 
value,  and  is  now  of  the  order  of  its  probable  error.  To  mark  still  further  this 
increased  fertility  of  heterogamous  unions,  I  add  two  further  tables,  giving  the  mean 
number  of  recorded  ofispring  for  various  classifications  of  parental  eye-colour. 
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Fathers,  Fathers. 


Colours 

I— o. 

Colours 

^— O. 

Average 

OI  lOX/dlS. 

■ 

Colours 

Colours 

o— o. 

Average 
of  totals 

F-i  . 
f3  CO 

O  1 

'o 
O 

3-83 

4-57 

4-07 

CO 

Ui  . 

^  cq 
o  1 

'o 

Q 

3-19 

4-52 

3-86 

u 

O  1 

3-82 

3-73 

3-79 

s 

b  • 

S  00 
O  1 

o 

o 

3-98 

3-96 

3-97 

Average 
of  totals 

3-83 

4-17 

3-94 

Average 
of  totals 

3-68 

4-09 

3-94 

The  first  table  entirely  confirms  all  the  conclusions  reached, — dark  fathers  and 
light  mothers  are  most  fertile  absolutely  and  in  union.  The  second  table  shows 
that  it  is  the  blue-green  and  grey  rather  than  pure  blue-eyed  mothers  who  are  most 
fertile.  This  supplementary  investigation  accordingly  seems  to  support  the  view  of 
the  text  of  the  memoir,  namely  that  reproductive  selection  is  the  source  of  the 
secular  change  in  eye-colour  noted,  only  the  prepotent  fertility  which  is  replacing  the 
blue-  eyed  element  is  in  the  first  place  that  of  the  dark-eyed  male,  and  only  in  the 
second  place  due  to  mothers  having  eye-colours  dark  or  light  other  than  true  blue. 
We  seem  accordingly  in  eye-colour  to  have  reproductive  selection  working  through 
heterogamy  rather  than  through  homogamy  as  in  the  case  of  stature.*  The  effect, 
however,  is  like, — -the  progressive  elimination  of  one  type  of  character.] 

*  See  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  30,  and  vol.  66,  p.  316  et  seq. 
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I.  Introductory. 

(1.)  The  present  paper  endeavours  to  deal  with  a  problem  ujoon  which  I  have  long 
been  occupied,  adopting  the  widest  basis  compatible  with  the  time  and  means  at  my 
disposal.  In  the  first  place,  I  have  often  been  impressed  with  the  small  reduction  in 
variability  which  can  be  produced  by  selection.  The  offspring  of  a  single  parent 
while  diverging  in  character,  possibly  very  widely  from  the  average  character  of  the 
race,  will  still  have  a  variability  in  that  character  only  slightly  reduced,  say  at  most 
10  per  cent,  below  the  racial  variability.  Even  if  we  select  the  ancestry  for  an 
indefinite  number  of  generations,  the  offspring  will  have  a  variability  upwards  of  89 
})er  cent,  of  that  of  the  original  race."*  Now  this  capacity  in  the  parent  for  pro- 
ducing variable  offspring  must  be  in  some  manner  related  to  the  degree  of  resemblance 
in  those  offspring.  We  have  thus  the  two  fundamental  divisions  of  our  subject : 
(i.)  What  is  the  ratio  of  individual  to  racial  variability  ? 

(ii.)  How  is  the  variability  in  the  individual  related  to  inheritance  within  the 
race  ? 

I  must  endeavour  to  explain  my  meaning  a  little  more  fully  and  clearly.  The 
individual  puts  forth  a  number  of  like  organs,  corpuscles  in  the  blood,  petals  of  the 
flower,  leaves  of  the  trees,  scales  on  the  wing.  These  may  or  may  not  be  divided  up 
into  differentiated  groups.  Special  forms  of  leaves  occur  in  the  neighbourhood  of  the 
fruit ;  florets  may  be  differentiated  according  to  their  position  on  the  flower,  scales 


*  "  On  the  Law  of  Ancestral  Heredity,"  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  400. 
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according  to  their  position  on  the  wing ;  there  may  be  two  or  more  classes  of  blood- 
corpuscles.  But  if  we  take  organs  which  are  like,  and  so  far  as  we  can  judge 
undifferentiated,  there  is  still  a  diiference  between  them.  The  individual  in  putting 
forth  what  I  may  perhaps  term  "  undifferentiated  like  organs,"*  does  so  with  a  certain 
measure  of  variability.  I  am  not  concerned  at  present  with  the  source  of  this 
variability.  It  may  be  due  to  the  individual  environment,  to  the  physique  of  the 
individual,  or  to  some  inherent  or  bathmic  tendency  in  the  individual  due  to  its 
ancestral  history.  All  I  am  concerned  with  at  present  is,  that  the  undifferentiated 
like  organs  of  an  individual  possess  a  certain  variability,  and  that  this  variability  is 
somewhat  less  than  that  of  all  like  organs  in  the  race.  If,  however,  the  variability 
Avere  entirely  due  to  external  circumstances  attending  growth,  we  should  hai-dly 
expect  to  find  leaves  gathered  from  different  branches  and  on  different  sides  of  a  tree 
more  alike  to  each  other  than  to  leaves  of  other  trees  of  the  same  race.  But  this  is 
indeed  the  case  ;  there  is  a  considerable  correlation  among  the  undifferentiated  like 
organs  of  an  individual  notwithstanding  the  small  reduction  on  racial  variability. 
Among  the  trees  and  plants  considered  in  this  paper  we  have  data  enough  to  enable 
us  to  determine  whether,  say,  a  hundred  leaves  placed  before  us  were  gathered  from 
a  single  tree,  or  collected  at  random  from  a  number  of  trees.  Here,  of  course,  I  mean 
by  to  "  determine"  to  state  the  odds  for  or  against  these  two  assumptions. 

Just  as  we  can  find  by  the  methods  already  discussed  in  earlier  memoirs  of  this 
series,  the  degree  of  correlation  between  brothers  and  the  variability  of  an  array  of 
brothers  due  to  the  same  parentage,  so  we  can  determine  the  correlation,  i.e.,  the 
degree  of  resemblance  between  the  undifferentiated  like  organs  of  the  individual  and 
the  degree  of  variability  within  the  individual.  This  determination  is  the  answer  to 
our  first  fundamental  jDroblem,  that  of  the  ratio  of  individual  to  racial  variability. 
But  turning  to  the  process  of  reproduction,  the  offspring  depend  upon  tlie  jDarental 
germs,  and  it  would  thus  seem  that  the  degree  of  resemblance  between  offspring 

*  [I  am  fully  conscious  of  rcrhal  difficulty  in  the  phrase  "  unditfei'entiatefl  like  organs,"  but  I  believe 
that  the  distinction  between  the  differentiated  and  the  undifferentiated  is  quite  clear  either  from  the 
standpoint  of  observation  or  from  that  of  the  frequency  distribution  itself.  Differentiation,  ^vhether  due 
to  function,  position  on  the  individual,  season  of  production,  &c.,  is  usually  connected  with  one  or  two 
well-marked  dominating  factors  ;  it  is  statistically  discoverable  by  testing  the  frequency  disti'ibution  for 
heterogeneity.  On  the  other  hand,  variability  in  "  luidifferentiated  like  organs  "  is  not  to  be  associated 
with  any  one  or  two  dominating  factors  ^^■hich  can  ))e  isolated  ;  it  is  due  to  that  cora1>ination  of  many 
small  causes,  inherent  and  environmental,  which  leads  to  what  is  familiar  in  1)oth  theory  and  observation 
us  a  homogeneous  chance  distribution.  A  diversity  due  to  ditlerentiation  and  a  variability  due  to  chance 
are  quite  distinct  things.  The  one  is  the  result  of  dominating  factors  which  can  be  isolated  and 
described ;  the  other  of  a  great  number  of  small  factors,  varying  from  organ  to  organ,  and  incapable  of 
being  defined  or  specified.  Indeed,  upon  each  dominating  factor  of  difierentiation  is  superposed  such  a 
chance  variability.  Of  course  all  things  which  differ  even  by  chance  variation  are  in  a  certain  sense 
<lilferentiated.  But  the  term  differentiation  is  throughout  this  paper  reserved  for  the  differences  which 
urise,  not  from  a  multiplicity  of  small  causes,  but  from  dominating  and  usually  easily  recognisable  single; 
influences. — Tiihi,  1901.] 
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must  depend  on  the  variability  of  the  sperm  cells  and  the  ova  which  may  each  be  fairly 
considered  as  "  undifferentiated  like  organs."  Here  again  we  are  not  compelled  to 
assert  that  much  or  little  is  due  to  environment  and  little  or  much  is  due  to  inherent . 
ancestral  influence.  All  we  assiune  is  that  such  causes  as  produce  the  likeness 
between  leaves  of  the  same  tree,  or  florets  on  the  same  flower,  produce  the  likeness 
between  spermatozoa  or  ova  of  the  same  individual,  and  that  on  this  likeness  th& 
ultimate  resemblance  of  offspring  from  the  same  parent  depends.  We  have  then  ta 
investigate  how  the  quantitative  resemblance  between  offspring  of  the  same  parents 
is  related  to  the  quantitative  resemblance  between  the  undifferentiated  like  organs  in 
the  individual ;  and  then  we  must  test  on  as  wide  a  range  of  data  as  possible  this 
theoretical  relationship. 

Now  the  reader  will  perceive  at  once  that  if  we  can  throw  back  the  resemblance  of 
offspring  of  the  same  parents  upon  the  resemblance  between  the  undifferentiated  like 
organs  of  the  individual,  we  shall  have  largely  simplified  the  whole  problem  of" 
inheritance.  Inheritance  will  not  be  a  peculiar  feature  of  the  reproductive  cells. 
One  frog,  let  us  say,  differs  from  another  in  that  it  produces  blood  corpuscles  more  or 
less  alike  and  unlike  those  of  another  frog.  In  the  simplest  forms  of  reproduction, 
budding  and  parthenogenesis,  the  offspring  will  not  be  absolutely  alike,  for  buds  and 
ova  are  undifferentiated  like  organs,  and  such  organs  hav^e  only  a  limited  degree  of 
resemblance.  If  this  view  be  correct,  variability  is  not  a  peculiarity  of  sexual  repro- 
duction, it  is  something  peculiar  to  the  production  of  undifferentiated  like  organs  in 
tlie  individual,  and  the  problems  of  heredity  must  largely  turn  on  how  the  resem- 
blance between  such  organs  is  modified,  if  modified  at  all,  by  the  conditions  of  nurture, 
growth,  and  environment  generally.  Our  discussion  of  the  subject  will  naturally  break 
up  into  the  following  divisions  : 

(a.)  On  the  theoretical  relationship  between  the  correlation  of  offspring  and  the- 
correlation  of  vindifi^erentiated  like  organs  in  the  individual. 

(h.)  A  determination  of  the  correlation  of  undifferentiated  like  organs  for  as  wide  a 
range  of  life  as  possible. 

The  present  paper  deals  only  with  variation  and  correlation  in  the  vegetable  king- 
dom, but  the  data  for  variation  and  correlation  in  the  animal  kingdom  are  being 
collected  at  the  same  time. 

(c.)  A  comparison  of  the  degree  to  which  the  results  obtained  from  (6)  satisfies  tha 
observed  values  of  fraternal  correlation  already  found  when  we  use  the  relation 
determined  in  («). 

II.  Theoretical  Discussion  of  the  Eelation  of  Fraternal  Correlation  to 
THE  Correlation  of  Undifferentiated  like  Organs. 

(2.)  Let  z  be  the  deviation  from  the  mean  of  the  general  population  of  any  indi- 
vidual with  regard  to  any  character.    Let  us  suppose  z  to  depend  upon  certain 
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characters  in  the  spermatozoon  and  certain  characters  in  the  ovum  from  which  the 
individual  has  developed.  These  characters  cannot  of  course  be  determined,  still  less 
measured,  but  we  have  no  reason  to  doubt  their  existence.  In  the  paa-ticular  sper- 
matozoon from  which  the  individual  has  developed,  let  them  have  deviations  x^,  x.^, 
.  .  .  from  their  mean  values  for  all  the  spermatozoa  of  the  race,  and  let  t/j,  ^/o,  3/3  ..  . 
be  the  corresponding  deviations  for  the  ovum  characters.  Then 

2  X,,  .     .     .     2/1,   Vl^y-i     '     ■     '  ) 

where  /  is  a  quite  unknown  function. 

The  mean  of  the  z-character  will,  however,  correspond  to  the  mean  values  of  the 
spermatozoon  and  ovum  characters,  and  if  we  suppose  the  variation  of  these 
characters  small  as  compared  with  their  mean  value,  we  assume  as  usual  for  such 
deviations  : 

z  =  a^x^  +  ao.ro  -h  a.^x^  +  .  .  .  +  ^^y^  +  ^oT/o  +  /S.^j/-^  +  .     .     .  (i.), 

where  the  as  and  ^'s  are  independent  of  the  xs  and  ?/'s,  and  define  the  male  and 
female  inheritance. 

Now  let  cr  be  the  standard  deviation  of  the  character  z  in  the  population ;  cr^  the 
standard  deviation  of  x.^,  cr'^  of  ijr^.  Let  r^.^  be  the  correlation  of  Xj,  and  x^^,  r'p^  of  ijp 
and  yq.  Then  we  will  suppose  that  there  is  no  selection  of  particular  ova  by  parti- 
cular spermatozoa,  or  that  Xp  and  y,^  are  not  correlated.  Then  if  n  =  number  of 
individuals  in  the  jjopulation  : 


cr  — 

n       '        '■  '■'  "  n 


where  S  is  the  sum  for  all  individuals  of  any  x-  or  y  for  constant  subscript,  and  S  is 
the  sum  of  a  and  yS  for  every  possible  subscript.  This  folloAvs  by  simple  squaring  and 
remembering  that  S  {.tp  y,^  —  0.    We  thus  reach  : 

Now  let  us  considei'  the  correlation  of  two  individuals  due  to  the  spermatozoa  and 
ova  put  forth  by  the  same  two  individuals.  Let  and  2^  be  the  values  of  their 
characters,  and  x' ,  x",  y',  y"  represent  the  fundamental  characters  in  the  two  sper- 
matozoa, and  two  ova  on  which  they  depend. 

Then  we  have 

2i  =  S  {ctpx'^)  +  S  {IBi,y'j) 
23  =  S  {(^px"p)  +  t  {/3py"p) . 

Now  let  us  multiply  2j  by  %  and  sum  for  every  fraternal  pair ;  then  if  E,  be  the 
VOL.  cxcvii. — A.  2  p 
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fraternal  correlation  we  have,  since  one  set  of  brethren  will  have  the  same  standard 
deviation  as  the  other,  if  they  be  taken  of  the  same  sex, 

S      X  2o)  =  ?i  X  o-  X  o-  X  R, 

where  n  is  the  total  number  of  pairs  of  brethren. 
Hence 

n  X  a'  X  R  =  t{a;{x',x%))  +  %  {aj,a,^^  {x' ,x" ,  +  x" ,x\)) 

+  2  (ArS  {y',y%))  +  t  {13^^  {y'.y",  +  y'>;)) 

with  the  same  notation  as  before  and  the  same  assumption  as  to  no  selective  process 
between  the  female  and  male  reproductive  cells. 

Now  x'p,  x' p  ai-e  undifferentiated  like  organs  put  forth  by  an  individual.  If  p^^  be 
the  correlation  of  such  : 

Similarly  S  (y'py'p)  =  na'/p'^^, . 

In  the  next  place  we  should  have  all  pairs  of  brothers  : 

&>{x'pX"r^)    =    &{x''/J0'j)    =    (Xj,(T,ippr,  , 

S  {y'py"'i)  =  S  {y%y\j)     o->'.,pV/  • 

We  thus  deduce 

cr-  X  R  =  S  {^fo-fppj)  +  t  {^p~a'./p'pp) 

+  2t{cc^,arja;^,(rrjppri)+2t{^p^rjO-'p(r'r/p'j^rj).    ....  (iii.). 

Now  if  the  degree  of  resemblance  between  undifferentiated  like  organs  vary  largely 
from  organ  to  organ,  we  could  proceed  no  further  with  this  investigation.  We  should 
have  shown  that  there  was  a  relation,  namely  (iii.)  between  the  II  and  the  ps,  but 
as  the  characters  in  the  spermatozoa  and  ova  are  unknown  and  unmeasurable,  we 
could  not  proceed  further.  On  the  other  hand,  if  the  correlation  between  undif- 
ferentiated like  organs  is  approximately  constant,  then  we  may  replace  correlation 
coefficients  like  ppp  and  p'p^  by  their  mean  value  p,  say. 

We  have  next  to  consider  such  correlations  as  pp^  or  p'p,^.  These  are  the  two 
correlations  between  two  different  characters  in  two  spermatozoa  or  ova  from  the 
same  individual.  Such  correlations  must  vanish  (i.)  if  there  be  no  organic  correlation 
between  characters  of  the  same  spermatozoon  or  ovum,  (ii. )  if  there  be  no  correlation 
between  the  reproductive  cells  put  forth  by  the  same  individual.  In  other  words,  pp^ 
must  contain  p  and  Vj,^  as  factors,  and  since  if  these  two  correlations  are  perfect  pp^j 
must  be  perfect,  it  is  a  reasonable  hypothesis  to  assume 

pprj  =  p  X  r^rj  (iv.). 

This,  it  will  })e  seen,  is  identical  in  form  with  the  result  I  have  supposed  to  hold 
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for  cross-heredity,'''  and  which  appears  to  be  approximately  true  for  that  case.  We 
shall  consider  later  statistics  bearing  on  this  result. 

Assuming  accordingly  that  (iv.)  holds,  we  find  from  (iii.) 

=:  p  X  a"  by  (ii.). 
Hence  we  conclude  that 

R  =  />  .    •    •  (v.)- 

Or  :  The  correlation  hetiveen  brothers  will  he  equal  to  the  mean  correlation  hetiveen 
the  undifferentiated  like  organs  put  forth  hy  an  individual. 

Now,  if  this  result  be  true,  it  is  very  remarkable  and  very  fundamental.  We 
should  hardly  expect  it  to  be  absolutely  true,  for  it  is  very  unlikely  that  the  coefficient 
of  correlation  between  undifferentiated  like  organs  is  the  same  whatever  the  organs 
may  be.  This  equality  may  rather  be  spoken  of  as  belonging  to  an  ideal  theoretical 
vital  state  approaching  the  actual  state,  perhaps,  as  Boyle's  Law  or  the  perfect  fluid 
approach  phenomena  observed  in  physical  nature.  What  we  should  expect  would 
be  a  general  approximation  between  the  values  of  R  and  p,  and  a  tendency  to  equality 
when  large  series  are  averaged.  This  is  the  point  which  we  shall  investigate  in  the 
sixth  section  of  this  paper,  after  placing  before  the  reader  in  the  fourth  section  a  fairly 
Avidet  range  of  actual  statistics. 

It  will  be  seen  at  once  that  if  (v.)  represents  an  approximate  or  average  truth  of 
living  nature,  then  the  problem  of  inheritance  is  to  a  large  extent  the  same  as  the 
jjroblem  of  variability  in  the  individual.  We  have  not  answered,  of  course,  even  in 
part,  the  problem  of  why  two  brothers  resemble  each  other,  but  we  see  that  it  is 
part  of  a  much  more  general  problem  having  nothing  to  do  with  sexual  reproduction. 
It  is  one  with  the  jDroblem  of  the  likeness  in  leaves  of  the  same  tree,  or  the  likeness 
in  scales  on  the  same  spot  of  a  moth's  wing.  It  brings  the  problem  of  heredity  into 
closer  touch  with  the  problem  of  variability.  When  we  ascertain  the  sources  of 
variation  in  the  individual,  then  we  shall  have  light  on  the  problem  of  fraternal 
resemblance. 

III.  On  the  Variability  and  Correlation  of  Undifferentiated  like  Organs 

IN  the  Individual. 

(3.)  I  must  frankly  admit  that  I  have  collected  my  material  from  the  standpoint  of 
the  mathematical  statistician  and  not  of  the  trained  field  naturalist.  I  have  sought 
things  which  were  easy  to  count  and  measure,  and  endeavoured  to  avoid  "  difierentia- 

*  "  The  Law  of  Ancestral  Heredity,"  '  Roj\  Soc.  Proc.,'  vol.  62,  p.  411. 

t  Not  wide  when  we  regard  the  natural  range  of  living  forms,  but  from  the  standpoint  of  the  laboui- 
that  has  been  spent  on  the  collection. 

2  p  2 
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tion  "  only  l)y  the  roughest  tests  and  by  largely  untrained  powers  of  observation.  I 
have  examined  my  material  to  see  if  it  gave  obvious  signs  of  heterogeneity  in  the 
tabulated  bulk.  I  have  examined  my  individuals  and  their  organs  for  obvious  out- 
ward signs  of  differentiation.  But  I  have  not  studied  the  morphological  evolution 
of  the  organs  considered,  or  questioned  whether  the  parts  counted  were  all  due  to  the 
same  source.  Hence  it  is  quite  possible  that  the  botanist  may  reject  at  once  some  of 
my  series.  What  I  have  endeavoured  to  do  is  to  take  as  wide  a  range  of  as  different 
organs  as  possible  in  different  types  of  life  and  trust  to  the  bulk  of  my  statistics  to 
give  me  a  substantially  accurate  average  value  of  p  to  compare  with  the  values  of  E. 
we  have  determined  on  other  occasions.  At  the  same  time  the  material  here  pre- 
sented does  not  by  any  means  exhibit  all  the  trials  made  ;  we  often  enough  at  the  first 
attemjDt  did  not  get  a  suitable  character  to  measure  or  count,  or  again  the  individuals 
collected  were  occasionally  insufficient.  Nothing,  however,  has  been  omitted  which 
was  unfavourable  to  the  conclusions  ultimately  drawn.  Thus  the  Nigella  Hispanica 
statistics  have  been  retained,  although  there  is  not  the  slightest  doubt  of  a  very 
considerable  differentiation  in  the  flowers  growing  at  different  parts  of  the  plant. 

When  such  a  differentiation  takes  place  the  result  will  generally  be  a  great  reduc- 
tion in  the  correlation;  for  "like"  organs  on  the  same  individual,  say  A  and  a  if 
differentiated,  Avill  really  be  less  closely  related  to  each  other  than  to  B  and  h  respec- 
tively, the  corresponding  organs  in  a  second  individual.  On  the  other  hand  a 
heterogeneity  of  material,  say  a  mixture  of  two  different  local  races,  will  tend  as  a 
rule  to  raise  the  correlation,"^  for  it  generally  amounts  to  compounding  two  very 
like  correlation  surfaces  with  the  mean  of  one  approximately  shifted  along  the 
regression  line  of  the  second.  If  we  bear  these  two  opposing  influences  in  mind, 
differentiation  tending  to  reduce,  heterogeneity  to  increase  the  actual  degree  of  corre- 
lation of  undifferentiated  like  organs,  it  will  not  seem  incompatible  with  an  actual 
approximate  constancy  of  such  correlation  to  find  a  fairly  wide  range  of  values  in  our 
statistics.  We  set  them  forth  as  the  first  rough  attempt  to  appreciate  the  resem- 
blance of  like  jDarts  within  the  individual.  To  the  specialist  in  the  future  must  be 
left  the  work  of  selecting,  with  special  knowledge,  truly  homogeneous  material  and 
absolutely  undifferentiated  characters,  and  thus  obtaining  the  required  correlation  to 
a  much  higher  degree  of  accuracy. 

(4.)  It  remains  to  explain  the  process  by  which  the  correlation  was  deduced.  Let 
us  take  as  an  illustration  beech-leaves.  One  hundred  trees  fairly  of  the  same  age  and 
belonging  to  the  same  district,  were  selected,  and  twenty-six  leaves  specified  by  the 
letters  of  the  alphabet  were  gathered  from  each  of  these.  The  leaves  were  gathered 
so  far  as  possible  all  round  the  outside  of  the  tree,  roughly  about  the  same  height  from 
the  ground,  and  scattered  over  different  parts  of  the  individual  boughs.  Thus  each 
tree  was  supjDosed  to  be  individualised  by  twenty-six  leaves.  The  veins  on  these  leaves 
were  then  counted,  and  varied  for  beech-leaves  in  general  between  ten  and  twenty-two, 

*  Sea  'Phil.  Trans.,'  A,  vol.  192,  p.  277. 
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All  the  possible  pairs  were  now  taken,  i.e.,  ^  {26  X  25)  =  325  in  number,  and  entered 
on  a  correlation  table  in  the  usual  manner,  the  two  variables  being  the  number  of 
veins  in  the  first  leaf  and  the  number  of  veins  in  the  second  leaf  But  as  either 
member  of  the  pair  might  be  a  "  first "  leaf,  the  table  so  formed  was  rendered  sym- 
metrical by  starting  with  either  leaf  in  the  pair  as  first  or  second.  Thus  a  single  tree 
led  to  650  entries  in  the  correlation  table,  or  with  100  trees  there  were  65,000  entries. 
This  large  number  must  not  lead  the  reader  to  overweight  the  importajice  of  the 
constants  calculated  upon  it.  There  were  only  100  trees  leading  to  32,500  pairs  of 
leaves,  each  pair  coming  from  the  same  tree.  I  should  have  much  preferred  a 
thousand  trees,  but  the  great  labour  of  collecting,  counting,  and  calculating  precluded 
any  such  number.  In  many  cases  also  it  would  have  been  practically  impossible  to 
have  obtained  100  individuals  growing  under  fairly  like  environment.  For  example, 
I  only  succeeded  in  approximating  to  100  Spanish  chestnuts  from  one  district  and 
near  one  age.  When  I  took  a  second  hundred,  gathered  from  several  districts,  partly 
old  trees  and  partly  others  of  a  pollard  growth  of  fifteen  to  twenty  years,  I  found  the 
heterogeneity  at  once  increased  the  correlation  (see  below,  p.  301).  In  many  cases,  of 
course,  it  was  impossible  to  obtain  twenty-six  undifferentiated  like  organs  from  the 
same  individual.  In  these  cases  the  pairs  were  formed  in  the  same  manner,  but  in 
some  series  the  total  number  due  to  each  individual  varied  very  considerably,  and 
accordingly  the  work  of  verifying  the  tables  was  much  increased.  When  but  few 
pairs  could  be  obtained  from  each  individual,  we  have  sometimes  increased  the 
number  of  individuals  dealt  with  up  to  a  couple  of  hundred.  But  the  labour 
of  dealing  even  with  a  hundred  individuals  is  often — for  example,  in  the  case  of 
mushrooms  and  onions — very  serious.  Had  it  not  been  for  the  generous  help  of  a 
number  not  only  of  willing  but  of  competent  collectors  and  calculators,  the  material 
here  dealt  with  would  have  taken  not  eighteen  months  but  years  of  my  own 
unaided  efforts. 

The  calculation  of  the  means,  standard  deviations,  probable  errors,  and  correlations 
of  each  table  was  carried  out  in  the  manner  sufficiently  discussed  in  earlier  papers  of 
this  series.*  The  symmetry  of  the  tables  leads  to  slight  simplifications  in  calculating 
the  product  moment  which  will  readily  suggest  themselves  to  the  reader,  and  of 
course  only  a  single  mean  and  standard  deviation  is  required  for  each  table.  Tests 
for  the  accuracy  of  the  last  two  constants  are  at  once  provided  in  the  case  in  which 
the  same  number  of  organs  are  taken  from  each  individual,  for  their  values  must  be 
identical  with  those  obtained  for  the  whole  series  of  organs  entered  only  once  and  not 
for  each  possible  jjair. 

(5.)  The  quantitative  measurement  of  the  degree  of  resemblance  between  undif- 
ferentiated-like  organs  being,  so  far  as  I  am  aware,  a  quite  novel  branch  of  investiga- 
tion, I  venture,  with  some  hesitation,  to  introduce  certain  terms  to  describe  oft- 

*  See  especially  Memoir  III.,  'Phil.  Trans.,'  A,  vol.  1S7,  pp.  2.5:3-318,  and  Memoii'  IV.,  'Phil.  Trans.,' 
A,  vol.  19J,  pp.  229-311. 
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recurring  ideas.  I  shall  call  undifferentiated  like  organs  homotyj^es,  indicating  that 
they  are  types  produced  by  the  same  mould  or  individual.  Thus  two  leaves  of  the 
same  tree,  or  two  blood-corpuscles  from  the  same  frog  are  homotypes  ;  their  resem- 
blance will  be  homotypic,  and  the  character  by  which  their  resemblance  is  quantita- 
tively measured  will  be  the  homotypic  character.  I  shall  speak  of  homotypic 
correlation  and  distinguish  it  from  organic  correlation — although,  of  course,  the 
former  is  in  a  certain  sense  organic.  By  organic  correlation  I  refer  to  the  correlation 
between  two  different  characters  in  the  same  organ  ;  by  homotypic  correlation  to 
the  correlation  between  the  same  or  different  characters  in  a  pair  of  homotypes. 
If  the  same  character,  then  the  correlation  is  direct  homotypic  correlation  ;  if  different 
characters,  then  the  correlation  is  cross  homotypic  correlation.  Lastly,  the  principle 
that  homotypes  are  correlated,  i.e.,  that  variation  within  the  individual  is  less  than 
that  of  the  race,  or  that  undifferentiated  like  organs  have  a  certain  degree  of 
resemblance,  I  shall  speak  of  as  homotyposis.  Thus  homotyposis  denotes  not  only 
likeness  of  the  homotypes,  but  that  this  likeness  has  probably  definite  quantitative 
limits.  If  my  view  be  correct,  heredity  is  only  a  special  case  of  homotyposis,  any 
multiplication  of  cells  is  homotypical,  and  denotes  a  given  degree  of  variation  and  a 
given  degree  of  likeness.  This  does  not  "  explain"  heredity,  but  shows  it  merely  as 
a  phase  of  a  much  wider  natural  process. 

IV.  Actual  Data. 
Section  I. — Leaves  of  Trees. 

(6.)  The  leaves  of  trees  provide  material  for  fairly  easy  computation  without 
measurement.  Thus  we  may  count  the  veins  on  the  leaf  or  the  leaflets  on  the  com- 
pound leaf,  and  ascertain  the  degree  of  resemblance  between  leaves  of  the  same  tree. 
The  variation  is,  however,  in  some  cases  very  considerable,  and  the  labour  of  forming 
the  tables  involving  thousands  of  entries  very  great.  Still  the  leaf-series  are  some 
of  my  best,  and  considered  as  a  whole,  perhaps  the  most  satisfactory. 

(7.)  A.  Ash  (Fraxinus  excelsior).  Numher  of  Pinnte  on  Leaf. — My  first  series  (i.) 
consists  of  26  leaves  taken  from  each  of  109  trees  by  Dr.  Alice  Lee  or  myself 
The  trees  were  nearly  all  large  old  trees,  growing  on  the  commons  of  Great  and 
Little  Hampden,  Buckinghamshire.  The  leaves  were  taken  as  far  as  possible  from  a 
variety  of  branches  all  round  the  tree,  and  from  different  points  on  these  branches. 
The  labour  of  tabling  and  deducing  the  constants  is  due  to  Dr.  Lee.  The  number 
of  pairs  =  70,850. 

My  second  series  (ii.)  consist  of  26  leaves  taken  from  each  of  120  trees  by  Miss 
C  D.  Fawcett,  B.Sc.  These  trees,  many  of  which  were  young  trees  growing  in 
the  hedgerows,  were  taken  in  the  neighbourhood  of  Lyme  Begis  in  Dorsetshire. 
The  same  rules  as  to  gathering  were  observed.  Dr.  Lee  again  undertook  the  labour 
of  tabling  and  the  deduction  of  the  constants.    The  number  of  pairs  =  78,000. 
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My  third  series  consists  of  26  leaves  from  each  of  100  trees  in  Monmouthshire, 
yoimg  trees  being  included,  but  the  same  rules  as  to  gathering  being  observed.  The 
whole  of  the  labour  of  gathering,  counting  and  deducing  the  constants  is  due  to 
Miss  Mildred  E.  Barwell,  of  Girton  College.* 

The  general  numerical  results  are  given  in  the  table  below  : — 


Resemblance  of  Ash  Leaves  from  same  Tree. 


Series. 

Number. 

Mean  No.  of 
leaflets. 

S.  D.  of 

leaflets. 

Correlation. 

Trees. 

Leaves. 

Pairs. 

Buckinghamshire 
Dorsetshire  . 
jMonmouthshire  . 

109 
120 
100 

2834 
3120 
2600 

708.50 
78000 
65000 

10-1295  ±  -0214 
9-7260  ±  -0239 
9-8766  ±  -0265 

1-6891  +  -0151 

1-  9759  +  -0169 

2-  0058  ±  -0188 

-3743 
-3964 
-4047 

"+  -0109^ 
■+  -0102^ 

+  -our 

t 

Mean  of  series  . 

9-9107 

1-8903 

-3918 

Thus  a  total  of  329  trees,  giving  8,554  compound  leaves  counted  and  yielding 
213,850  pairs  on  the  symmetrical  tables,  were  dealt  with  in  all. 

It  will  be  seen  at  once  that  the  mean  number  of  pinnce  to  the  leaf,  and  their 
variability  differs  considerably  with  the  locality,  the  age  and  the  environment  of  the 
tree,  but  the  correlation  of  leaves  from  the  same  tree  is  fairly  alike  for  the  three 
series,  and  almost  identical  with  the  theoretical  value  '4000  given  by  the  Law 
of  Ancestral  Heredity  for  the  degree  of  resemblance  between  pairs  of  brothers. 

The  following  is  the  actual  distribution  of  the  frequency  of  the  leaflets  : — - 


Number  of  Pinnae  on  Leaves, 


3. 



4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

Total. 

Buckingh'shire 

3 

0 

16 

21 

201 

67 

879 

156 

1140 

68 

257 

9 

17 

0 

2834 

Dorsetshire.  . 

\ 

0 

84 

30 

396 

115 

959 

228 

911 

72 

280 

11 

29 

1 

3120 

JMonmouthshire 

5 

42 

24 

279 

55 

836 

143 

896 

83 

216 

6 

13 

1 

2600 

Total    .  . 

8 

5 

142 

75 

876 

237 

2674 

527 

2947 

223 

753 

26 

59 

2 

8554 

*  This  was  a  remarkably  satisfactory  piece  of  work,  carried  through  from  the  collecting  of  the  raw 
material  down  to  the  calculation  of  the  constants,  the  statistical  methods  having  to  be  mastered  and 
applied  during  the  cou'  se  of  the  investigation. 

t  In  determining  the  probable  errors  of  the  correlation  coefficients,  there  is  some  c^uestion  as  to  whether 
we  should  use  the  number  of  individuals,  which  seems  too  small,  the  number  of  pairs,  which  seems  too  large, 
or  the  number  of  individual  organs  dealt  with.  I  have  used  the  latter,  but  enclose  the  probable  errors  in 
brackets  to  mark  the  doubt. 
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It  will  be  remarked  at  once  that  the  leaves  with  an  even  number  of  leaflets  are 
relatively  infrequent ;  there  is  usually  a  single  leaflet  at  the  end  of  the  compound 
leaf.  I  was  able  at  Hampden,  however,  to  obtain  leaves  illustrating  almost  the 
whole  evolution  of  this  single  leaflet.  Sometimes  this  leaflet  had  the  slightest 
division  at  its  point ;  this  division  was  found  in  most  stages  of  double  tongued 
leaflet  down  to  a  comjilete  double  leaf,  which  would  be  reckoned  as  two  pinncc.  In 
the  same  way  the  centre  leaflet  was  found  slightly  trisected,  three  tongued  and 
finally  a  triple  leaflet  scarcely  distinguishable  from  the  usual  single  leaf  at  the  end 
and  the  two  nearest  side  leaflets.  Very  occasionally  the  double  leaflet  seemed  to 
give  one  side  leaflet  and  the  final  leaflet.  Of  course,  all  these  anomalies  were  rare 
and  had  to  be  sought  for,  but  they  would  form  in  themselves  a  suggestive  study. 

The  Dorsetshire  series  gives  9  pfyr/zo?  instead  of  11  as  the  mode,  and  7  are  more 
frequent  than  13.  This  latter  result  is  also  true  for  the  Monmouthshire  series, 
which  again  has  nothing  like  the  same  preponderance  on  its  modal  11  as  the 
Buckinghamshire  series.  I  take  it  that  the  greater  variability  of  both  Dorsetshire 
and  Monmouthshire  series,  together  with  their  greater  relative  proportion  of  com- 
pound leaves  with  9  and  7  ■pinncB,  is  due  to  those  series  containing  a  much  larger 
proportion  than  the  Buckinghamshire  series  of  small  trees. 

For  ashes  in  general,  I  think,  we  may  safely  take  10  as  the  average  (but  not 
the  modal)  number  of  pinna,  the  standard  deviation  is  1'9  pmncB,  and  the  leaf 
resemblance  is  measured  by  a  correlation  of  "4. 

If  we  compare  racial  and  individual  variabilities  we  have  : 


Series. 

S.  D. 

S.  D.  of  array. 

Coefficient  of 
variation. 

Percentage 
variability. 

Buckinghamshire  . 

1-6891 

1-5663 

15-46 

92-73 

Dorsetshire  .... 

1-9759 

1-8U0 

18-65 

91-81 

Monmouthshire     .  . 

2-0058 

1  -  8342 

18-57 

91-44 

INIean  . 

1-8903 

1-7382 

17-56 

91-99 

Thus  92  per  cent,  of  the  total  racial  variability  of  this  character  of  the  ash  is  to  be 
found  in  the  individual  tree,  and  here,  as  in  all  our  other  series,  it  is  impossible  to 
assert  that  variation  is  a  result  of  sexual  reproduction  ;  it  is  clearly  an  essential 
factor  of  the  individual  growth,  I  conclude  with  the  actual  tables  of  data  for  the 
three  series. 
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(8.)  B.  Spanish  Chestnut  (Csist&nea  vnlgSiris).  Veins  in  the  Leaf. — A  character  which 
at  first  sight  appears  easy  to  determine  is  the  number  of  veins  running  from  the  main 
rib  to  the  edge  of  a  leaf  Tlie  leaves  of  many  trees,  however,  will  on  examination 
be  found  quite  unsuitable,  the  veining  being  far  too  complex,  too  much  indeed  of  a 
network.  Among  fairly  practicable  leaves  are  those  of  the  beech  and  the  Spanish 
chestnut.  Here  two  fundamental  difficulties  occur  :  (i.)  near  the  tip  of  the  leaf  the 
number  of  veins  becomes  rather  difficult  to  register.  In  the  case  of  the  beech  by 
holding  the  leaf  in  various  lights  or  even  using  a  lens,  it  was  possible  to  be  fairly 
sure  of  the  separate  veins.  In  the  case  of  the  chestnut,  the  total  number  of  veins  is 
so  great  that  the  addition  or  omission  of  a  single  vein  is  hardly  likely  on  the  average 
to  lead  to  any  serious  error,  (ii. )  The  occasional  branching  or  bifurcation  of  the  vein 
formed  a  difficulty,  especially  if  it  occurred  near  the  edge  of  the  leaf  If  the  bifurca- 
tion occurred  close  up  to  the  main  rib  of  the  leaf,  the  vein  system  was  counted  as 
two  ;  if  close  to  the  edge  of  the  leaf  as  one ;  intermediate  bifurcations  had  to  be 
left  to  the  judgment  of  the  recorder,  having  regard  to  the  general  appearance  of  the 
leaf  and  its  system  of  subsidiary  veining.  The  doubtful  cases  were  not  very  many, 
and  occurred  more  frequently  with  the  chestnut  than  with  the  beech  leaves.  In 
neither  case  was  the  main  rib  or  vein  included  in  the  total  number  for  the  leaf, 
the  record  was  of  the  side  veins  flowing  from  this.  The  numbers  right  and  left 
in  the  beech  leaf  were  found  to  be  far  more  nearly  equal  than  in  the  case  of  the 
chestnut,  which  thus  jaossessed  considerably  greater  asymmetry.  In  these  cases 
as  in  those  of  the  ash,  the  leaves  were  gathered  at  random  round  the  tree,  and 
at  random  on  the  branches,  care  being,  however,  taken  in  the  case  of  the  chestnut 
to  avoid  leaves  in  the  neighbourhood  of  the  fruit.* 


Series  (i.).  Hampden  Spanish  Chestnuts. 

I  had  hojDed  to  get  records  for  26  leaves  from  each  of  100  trees  from  the  splendid 
chestnut  avenue  at  Hampden  House,  but  it  provided  records  for  some  60  trees  only. 
These  I  owe  to  Miss  Lina  Eckenstein.  I  was  able  to  add  by  diligent  search 
al)out  25  additional  trees  from  the  same  neighbourhood.  All  these  were  large 
forest  trees  of  splendid  growth.  In  order  to  make  the  number  up  to  100,  some  15 
additional  trees  were  added,  these  being  all  large  old  trees  from  other  neighbour- 
hoods. I  look  upon  this  series  as  a  fairly  homogeneous  one,  representing  the  degree 
of  resemblance  between  leaves  on  fully  developed  forest  trees.  The  counting  of  the 
veins  on  this  series  was  undertaken  by  Miss  C.  D.  Fawcett,  B.Sc,  and  the 
tabulating  and  numerical  reduction  by  Dr.  Lee.  Both  these  processes  involved 
an  immense  amount  of  labour,  in  fact  days  and  days  of  careful  and  fatiguing  work. 

*  The  leaves  were  all  gathered  in  the  autumn,  towards  the  fall. 


Table  IV. — Hampden  Chestnut  Trees. 


Number  of  Veins  on  First  Leaf.  ri>  ^ 
 1  I  \ioface;p. 
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0 
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25  50 

0 
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4475 
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3550 

3725 
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2775 

2675 

1475 

1550 

975 
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475 
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75 

0 

75 

65000 

Series  (ii.).    Table  V. — Mixed  Set  Spanish  Chestnut  Leaves. 
Nxmhet-  of  Veim  on  First  Leaf. 
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41. 

42. 

43. 

44. 

45. 

46. 

i  47. 

48. 

49.' 

50. 

51. 

!  52. 

53. 

1  54. 

55. 
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95 

113 

61 

57 

26 

26 

17 

13 

1 

0 

0 

0 

2225 

50     —    —  - 

— 

— 

— 

— 

2 

2 

9 

9 

3 

9 

7 

15 

41 

20 

39 

43 

76 

85 

131 

121 

135 

140 

150 

172 

128 

79 

120 

52 

45 

18 

23 

9 

15 

1 

0 

0 

1 

1700 

— 

0 

1 

0 

1 

1 

5 

8 

6 

ir 

12 

19 

34 

37 

56 

54 

52 

70 

79 

66 

96 

79 

42 

72 

49- 

44 

28 

16 

13 

5 

0 

0 

0 

0 

950 



2 

2 

1 

3 

5 

7 

15 

14 

19 

20 

24 

26 

42 

53 

53 

62 

76 

117 

87 

113 

120 

72 

92 

58 

38 

20 

25 

16 

18 

0 

0 

0 

0 

1200 

/ 

53     —    —  - 

3 

1 

3 

3 

5 

3 

12 

10 

22 

8 

18 

23 

31 

29 

43 

35 

50 

66 

28 

61 

52 

49 

58 

50 

50 

36 

15 

20 

16 

1 

0 

0 

0 

800 



1  j 

1 

1 

2 

6 

1 

2 

6 

6 

12 

2 

8 

21 

21 

22 

49 

24 

45 

45 

37 

57 

45 

44 

38 

60 

24 

26 

8 

12 

9 

1 

0 

0 

0 

625 





0 

0 

1 

0 

2 

3 

6 

2 

3 

8 

13 

9 

23 

22 

23 

33 

13 

26 

18 

28 

20 

36 

26 

10 

7 

12 

5 

1 

0 

0 

1 

350 

0 

0 

1 

i 

g 

4. 

6 

7 

(J 

10 

10 

9 

16 

23 

10 

35 

16 

26 

23 

16 

25 

15 

8 

7 

4 

6 

6 

0 

0 

0 

1 

300 

57     _    _  _ 
54     —  — 

1 

0 

0 

4 

0 

8 

8 

2 

1 

0 

10 

11 

5 

1 

14 

5 

20 

6 

17 

9 

12 

16 

20 

12 

12 

6 

14 

12 

— 

— 

— 

- 

225 

1 

1 

0 

3 

6  1 

6 

1 

2 

0 

12 

10 

12 

11 

14 

7 

22 

13 

13 

16 

5 

18 

16 

9 

6 

6 

12 

6 

— 

— 

— 

— 

225 

55     —    _  _ 
60     _  _ 

1 

2 

0 

1 

0 

8 

2 

2 

3 

1 

1 

1 

0 

0 

1 

1 

1 

0 

25 

1 

0 

0 

0 

0 

0 

0 

0 

ol 

0 

0 

0 

0 

0 

0 

0 

0 

61       -    _  _ 

0 

0 

0 

0 

0 

0 

0 

oi 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

6 

5 

1 

7 

2 

0 

1 

0 

0 

0 

0 

1 

1 

25 

T 

25    25  25 

50  100  150  225  2-5 

0  -575  ^475 

1000  j 

1025  1 

1175  j 

1350  j 

1700 

2200 

2350 

2900 

2775 

3675 

3750 

4400 

4000 

4050 

4225 

4375 

3400 

3425 

2850 

2426 

2225 

1700 

950 

1^00 

800 

625 

350 

300 

225 

1225 

25 

1  ° 

1  ° 

25 

67600 
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Series  (ii.).  Mixed  Spanish  Cliestnuts. 

My  second  series  consists  of  26  leaves  from  each  of  104  trees.  These  were 
gathered  in  a  variety  of  neighbourhoods.  Many  of  these  were  young  trees. 
In  the  case  of  nearly  50  sets,  the  trees  were  pollards  from  old  stumps  with  about 
1 5  years'  growth  ;  some  of  the  others  were  from  fine  old  trees,  and  as  a  result 
we  have  many  districts  and  many  ages  combined.  I  have  to  thank  Miss  C.  D. 
Fawcett  for  sets  from  the  neighbourhood  of  Hampstead,  Miss  Grace  Lee  for  sets 
from  that  of  St.  Albans,  Miss  Charles  for  sets  from  the  neighbourhood  of  Dorking, 
while  about  half  the  series  was  gathered  by  Mr.  Egbert  J.  Parker  from  his 
chestnut  woods  at  Haslemere.  The  counting  of  the  veins  was  undertaken  by 
Miss  M.  NoTCUTT,  while  the  tabulation  and  numerical  reduction  are  the  work 
of  Dr.  W.  R.  Macdonell.  The  series  being  mixed,  the  variability  is  higher  than 
that  of  the  first  series  and  thus  the  labour  spent  on  it  was  even  greater. 

The  following  table  gives  the  general  results  : — • 


Resemblance  of  Spanish  Chestnut  Leaves  from  same  Tree. 


Series. 

Number. 

Mean  number 
of  veins. 

S.  D.  of  veins. 

Correlation. 

Trees. 

Leaves. 

Pairs. 

Hampden  . 
Mixed    .  . 

100 
104 

2600 
2704 

65,000 
67,600 

41-0735  +  -0775 
41-2145  ±  -0840 

5-  8780  +  -0550 

6-  4780  ±  -0594 

-4655[+  -0104] 
•5913[±  -0084] 

Mean  .  . 

41-1440 

6-1780 

-5284 

We  note  at  once  that  the  mean  number  of  veins  for  the  forest  trees  and  for  the 
mixed  group  is  sensibly  the  same.  On  the  other  hand,  the  variability  and  the 
correlation  differ  very  considerably.  The  latter  is  sensiljly  higher  than  the  "4 
obtained  for  the  leaflets  of  the  ash.  I  should  be  inclined  to  consider  that  the 
heterogeneity  of  the  mixed  series  is  at  the  bottom  of  the  difference  between  the  two 
series,  but  there  cannot,  I  think,  be  any  doubt  of  the  greater  resemblance  of  chestnut 
leaves  for  this  character  than  of  ash  leaves  in  the  number  of  pinnce. 

The  actual  distribution  of  veins  in  the  5304  leaves  examined  was  as  follows :  — 
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No.  of  veins. 

19. 

20. 

21. 

22. 

23. 

1 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 

33. 

1st  series  .... 

1 

2 

0 

0 

5 

4 

6 

14 

14 

19 

42 

43 

49 

60 

2ncl  sei'ies  .... 

1 

1 

1 

2 

4 

6 

9 

10 

23 

19 

40 

41 

47 

54 

68 

lotal  .... 

1 

3 

2 

4 

11 

13 

16 

37 

33 

59 

83 

90 

103 

128 

No.  of  veins. 

34. 

35. 

36. 

37. 

38. 

39. 

40. 

41. 

42. 

43. 

44. 

45. 

46. 

47. 

48. 

1st  series  

89 

102 

120 

125 

149 

165 

198 

155 

179 

153 

142 

.149 

141 

Ill 

107 

2nd  series  .... 

88 

94 

116 

111 

147 

150 

176 

160 

162 

169 

175 

136 

137 

114 

97 

Total  .... 

196 

236 

236 

296 

315 

374 

315 

341 

322 

317 

285 

278 

225 

204 

No.  of  veins. 

49. 

50. 

51. 

52. 

53. 

54. 

55. 

1 

56. 

57. 

58. 

59. 

GO. 

61. 

62. 

1 

Total. 

1st  series  

59 

62 

39 

48 

15 

1 

19 

4 

4 

3 

0 

3 

2600 

2nd  series  .... 

89 

1  68 

38 

48 

32 

25 

14 

12 

9 

9 

1 

0 

0 

1 

2704 

Total  .... 

148 

i  130 

t 

77 

96 

47 

44 

1  18 

i 

16 

12 

9 

4 

0 

0 

1 

5304 

The  total  gives  a  fairly  smooth  series  with  the  mode  at  40,  and  a  secondary  mode 
at  42.  In  the  forest  trees  we  have  the  same  result  as  in  the  total,  but  in  the  mixed 
group  the  secondary  mode  is  at  44,  and  almost  equally  important  with  the  primary. 
The  influence  of  symmetry  makes  itself  felt  in  the  preponderance  of  the  even 
numbers,  although  not  in  such  a  marked  degree  as  in  the  case  of  the  ash,  where  the 
counting  of  the  final  leaflet  gave,  of  course,  preponderance  to  the  odd  numbers. 

For  the  Spanish  chestnut  we  may  accordingly  take  41  veins  as  the  average, 
40  as  the  modal  number  to  the  leaf;  6*2  as  the  S.D.  of  the  veins;  and  "5  (slightly 
weighting  the  homogeneous  series)  as  the  degree  of  resemblance  between  the  leaves  ol 
the  same  tree. 

It  remains  to  consider  the  comparative  individual  and  racial  variabilities : — 


Series. 

S.  D. 

S.  D.  of  array. 

Coefficient  of 
variation. 

Percentage 
variability. 

Hampden  .... 
Mixed  .    .    .    .  . 

5-  8780 

6-  4780 

5-2024 
5-2242 

14-  31 

15-  72 

88-51 
80-65 

Mean  .... 

6-1780 

5-2133 

15-01 

84-58 

We  see  that  the  variability  of  the  chestnut  leaf  as  measured  by  the  coefficient  of 
variation  is  somewhat  less  than  that  of  the  ash.    The  individual  as  compared  with 
the  racial  variability  is  also  somewhat  less  than  in  the  case  of  the  ash,  but  the 
individual  exhibits  still  some  8  5  per  cent,  of  the  racial  variability.    Tables  IV.  and  V, 
give  the  data  for  the  chestnut  series. 
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(9.)  C.  Beech  (Fagus  sylvatica).  Number  of  Veins  on  the  Leaf.— I  have  here  only 
one  series  of  26  leaves  from  each  of  100  trees.  These  were  gathered  and  counted  by 
Mrs.  Karl  Pearson  and  myself  in  the  neighbourhood  of  Great  Hampden.  The 
trees  were  in  part  fine  isolated  specimens  growing  on  the  commons,  but  as  there  were 
not  sufficient  of  these  we  were  compelled  to  gather  from  the  somewhat  smaller  trees 
abounding  in  the  beechwoods  of  this  part  of  Buckinghamshire.  The  leaves  were 
gathered  about  4  to  5  feet  from  the  ground  on  the  outside  of  the  tree.*  The 
tabulation  of  the  results  and  the  calculation  of  the  constants  are  again  the  work  of 
Dr.  Lee. 


Hampden  Beech  Leaves. 


Number. 

Mean  number 
of  veins. 

S.D.  of 
veins. 

Correlation 
of  pairs. 

Trees. 

Leaves. 

Pairs. 

100 

2600 

65,000 

16-1062 
± -0230 

1-7351 
±  -0162 

•5699 
[±  -0087] 

Distribution  of  Number  of  Veins. 


Number  of  veins ... 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

Total. 

Frequency   

1 

7 

34 

110 

318 

479 

595 

516 

307 

181 

36 

15 

1 

2600 

Individual  and  Racial  Variation. 


S.D. 

S.D.  of  array. 

Coefficient  of  variation. 

Percentage  variability. 

1-7351 

1-4258 

10-77 

82-17 

Accordingly,  we  see  that  the  modal  value  of  the  beech  leaf  veins  is  16,  which  is 
very  near  the  mean,  16"11.  The  standard  deviation  about  1"7,  and  the  degree  of. 
resemblance  among  its  leaves  "57  Measured  by  its  coefficient  of  variation,  it  is  the 
least  variable  of  the  trees  so  far  dealt  with,  and  the  leaves  of  the  same  tree  more 
closely  resemble  each  other.  Notwithstanding  its  relatively  small  variability,  the 
individual  on  an  average  exhibits  about  82  per  cent,  of  the  racial  variability. 

The  table  of  reduced  data  is  given  below. 

*  I  did  not  observe  among  the  leaves  gathered  anything  of  the  differentiation  indicated  hj  Professor 
Macleod  between  "  licht-  en  schadernbladen."  The  material,  so  far  as  this  character  (number  of  veins) 
goes,  seemed  singularly  homogeneous  and  of  small  variability. 
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Table  VI. — Hampden  Beech  Trees. 

Number  of  Veins  on  First  Leaf. 


1  r\ 
lU. 

1 1. 

1  0 

1  O. 

1  A 

14. 

10. 

1  R 
1  0. 

i  1 . 

io. 

iy. 

22. 

iotaJs. 

A 

U 

0 

fr 
/ 

/? 
0 

Q 

o 

25 

11 

2 

8 

35 

45 

59 

23 

3 

175 

12 

7 

35 

92 

145 

261 

156 

92 

47 

15 

850 

13 

7 

45 

145 

394 

836 

689 

387 

199 

46 

1 

1 

— 

2750 

U 

6 

59 

261 

836 

2124 

2021 

1616 

723 

243 

54 

4 

o 

7950 

15 

3 

23 

156 

689 

2021 

3256 

3122 

1856 

605 

221 

20 

3 

11975 

16 

3 

92 

387 

1616 

3122 

4576 

3205 

1380 

437 

51 

6 

14875 

17 

47 

199 

723 

1856 

3205 

3914 

1928 

860 

120 

46 

2 

12900 

18 

15 

46 

243 

605 

1380 

1928 

1882 

1278 

213 

80 

7675 

19 

1 

54 

221 

437 

860 

1278 

1248 

282 

137 

? 

4525 

20 

1 

4 

20 

51 

120 

213 

282 

'  138 

64 

7 

900 

21 

3 

3 

6 

46 

80 

137 

64 

32 

4 

375 

22 

2 

5 

7 

7 

4 

0 

25 

Totals 

25 

175 

850 

2750 

7950 

11975 

14875 

12900 

7675 

4525 

900 

375 

25 

65000 

s 


(10.)  D,  Holly  (Ilex  Aquifolium).  Number  of  Pricldes  on  the  Leaf. — This  character 
seemed  an  easy  one  so  far  as  mere  counting  goes,  although  the  leaves  were  troublesome 
to  gather.  The  leaves  were  gathered,  so  far  as  possible,  all  round  the  tree  or  bush, 
about  the  height  at  which  cattle  would  browse,  and  from  the  outside  of  the  bush. 
Internal  and  high  branches  (often  differentiated  by  being  almost  prickleless)  were  to 
be  avoided.  Flowering  trees,  and,  above  all,  leaves  near  the  flowering  buds  or  fruit 
were  to  be  rejected.  These  conditions  could  only  be  approximately  followed  in 
practice  when  a  large  number  of  trees  had  to  be  dealt  with  from  one  neighbourhood, 
and  these  series,  I  fancy,  allow  more  to  the  personal  equation  of  the  gatherer  than  is 
wholly  satisfactory. 

Series  (i.).  Somersetshire  Hollies. 

I  owe  this  series  of  26  leaves  from  each  of  100  trees  to  Miss  Agnes  Fey.  Of  these 
trees,  56  were  hedge-row  hollies,  and  44  wood  trees  or  bushes  growing  in  moist 
places,*    The  counting  of  the  prickles  was  in  some  part  undertaken  by  Mr,  Macleod, 

*  Miss  Fky  writes  to  me:  "The  texture  of  the  leaves  seems  rather  different,— the  wood  hollies  are 
much  less  stout  and  harsh  and  more  often  have  few  prickles.  In  fact,  I  rather  wonder  if  shade  does  not 
affect  the  number.  In  the  few  cases  where  hedge-row  hollies  have  very  few  prickles  the  leaves  come  from 
the  inside  of  the  bush.  I  have  carefully  avoided  the  neighboiu-hood  of  berries,— and  what  is  more 
difficult  of  flower  buds.  But  the  holly  has  two  flowers,  one  with  abortive  pistil  and  one  with  abortive 
stamens.  The  latter  I  suppose  would  drop  away  and  leave  no  trace  of  their  presence.  But,  on  the 
whole,  I  have  avoided  flower  and  berry-bearing  bushes — it  is  easy  to  find  bushes  without  either.  And,  so 
far  as  my  observation  goes,  I  am  not  convinced  that  the  neighboiu-hood  of  flowers  or  the  top  of  the  tree 
causes  modification  ....  It  is  very  difficult  to  keep  to  a  standard  of  counting  for  the  tip,  but  I  have 
tried  to  do  so.  Doubtless  the  hollies  too  are  of  rather  different  ages,  but  I  have  avoided  patriarchal  trees, 
though  doubtful  if  this  makes  much  diflerence  in  this  case."  It  will  be  seen  that  in  this  point,  as  in  others, 
much  had  to  be  left  to  the  judgment  of  a  careful  observer. 
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but  for  more  than  three-quarters  of  the  trees  by  Miss  Fry  herself  The  labour  of 
tabulation  and  the  deduction  of  the  constants  was  undertaken  by  Dr.  Lee. 

Series  (ii.).  Dorsetshire  Hollies. 

This  series  of  26  leaves  from  each  of  100  trees  was  gathered  and  counted  by 
Miss  C.  D.  Fawcett,  B.Sc.  It  differs  in  several  important  respects  from  Miss  Fry's 
series.  As  it  came  from  the  neighbourhood  of  the  sea,  by  Lyme  Regis,  we  might 
expect  a  considerable  difference  of  environment  ;  the  trees  were  chiefly  hedgerow 
trees.  They  show,  however,  a  lower  mean  number  of  prickles  than  the  Somersetshire 
series,  which  embraced  a  large  element  of  wood  trees.  I  think  the  series  is  less 
homogeneous  than  the  Somersetshire  series.  In  particular.  Tree  91  contributes  the 
almost  abnormal  series  of  24  leaves  with  1  and  2  leaves  with  2  prickles.  The  lump 
at  the  end  of  the  frequency  distribution  is  practically  due  to  this  one  tree.  I  do  not 
feel  able  to  reject  it  entirely,  but  I  give  the  numerical  constants  as  determined  with 
and  without  it.  I  take  it  that  of  the  two  observers  the  personal  equation  of  one 
tended  rather  to  emphasise  the  need  for  a  mathematically  "random  selection"  of 
trees,  and  the  personal  equation  of  the  other  to  emphasise  the  importance  of  a 
botanically  "homogeneous  group"  of  trees.  The  result  has  been  the  increase  of 
correlation  and  variability  in  the  first,  and  their  decrease  in  the  second  case. 
Probably  their  mean  is  not  far  from  the  truth.  It  is  noteworthy  that  the  ash  trees 
from  Dorsetshire  and  Monmouthshire,  although  exhibiting  a  like  substantial  difference 
of  environment,  were  far  more  alike. 

The  tabulation  and  reduction  of  this  series  is  also  due  to  Dr.  Lee.  I  now 
summarise  the  results  as  before  : — 


Resemblance  of  Holly  Leaves  from  the  same  Tree. 


Series. 

Number. 

Mean  number  of 
prickles. 

S.  D.  of  prickles. 

Correlation. 

Trees. 

Leaves. 

Pairs. 

Somersetshire  (i.) 
Dorsetshire  (ii.)«. 

100 
100 
99 

2600 
2600 
2574 

65,000 
65,000 
64,350 

15-2935  ±  -0379 
13-4373  +  -0497 
13-5622  ±  -0474 

2-  8655  +  -0268 

3-  7595  ±  -0352 
3-5661  ±  -0335 

•3648 
-6423 
•  5985 

±  -0115" 
'±  -0078' 
■±  -0085' 

Mean  (i.)  and  (ii.)« 
Mean  (i.)  and  {n.)h 

14-3654 
14-4278 

3-3125 
3-2158 

•5035 
•4816 

(ii.)6  is  the  Dorsetshire  series,  excluding  tree  (91). 


Ill  round  numbers  accordingly  the  number  of  prickles  on  the  holly  leaf  is  14  "4,  the 
standard  deviation  3 "3  prickles,  and  the  degree  of  resemblance  "5.  The  holly  thus 
falls  between  our  results  for  the  ash  and  the  chestnut,  one  series  approaching  our 
Buckinghamshire  ashes,  the  other  the  group  of  mixed  Spanish  chestnuts,  being 
somewhat  in  excess  of  the  Buckinghamshire  beeches. 
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The  actual  frequency  of  prickles  is  given  by  : 


Series. 

0. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9.  ; 

i 

10. 

11. 

12. 

13. 

Somersetshire  (i.)  .    .  . 

1 

2 

2 

3 

6 

6 

10 

8 

10 

24 

43 

74 

135 

276 

Dorsetshire  (ii.)^(.    .    .  . 

0 

40 

21 

14 

18 

27 

31 

45 

62 

84 

86 

152 

219 

336 

(ii-y-*  .   .  . 

16 

19 

14 

18 

27 

31 

45 

62 

84 

86 

152 

219 

336 

Total  ^1    anrl  (u  \ii 

1 

42 

23 

17 

24 

33 

41 

53 

72 

108 

129 

226 

354 

612 

Total  (i.)  and  (ii.)/j  . 

1 

18 

21 

17 

24 

33 

41 

53 

72 

108 

! 

129 

226 

354 

612 

Series. 

u. 

15. 

16. 

17. 

18. 

19. 

20. 

21. 

22. 

23. 

24. 

25. 

26. 

Total. 

Somersetshire  (i.)  .    .  . 

335 

417 

407 

328 

235 

122 

79 

44 

22 

6 

3 

1 

1 

2600 

Dorsetshire  (ii.)«    .    .  . 

350 

337 

311  ! 

217 

128 

70 

31 

12 

8 

1 

0 

0 

0 

2600 

{n.y>  .  .  . 

350 

337 

311 

217 

128 

70 

3. 

1 

2 

8 

1 

0 

0 

0 

2574 

Total  (i.)  and  {n.)a 

685 

754 

718 

545 

363 

192 

110 

56 

30 

7 

3 

1 

1 

5200 

Total  (i.)  and  {ii.)b  . 

685 

754 

718 

545 

363 

192 

110 

56 

30 

7 

3 

1 

1 

5174 

The  series  is  thus  seen  to  be  fairly  uniform,  the  mode  corresponding  in  the  total 
result  as  well  as  the  Somersetshire  series  to  1 5  prickles,  while  that  of  the  Dorsetshire 
series  is  at  14,  an  even  number  of  prickles.  The  holly  leaf  has  clearly  not  the 
marked  symmetry  of  ash  or  chestnut  leaves.  Thus,  16  prickles  in  the  Somersetshire 
series  are  almost  as  frequent  as  15,  or  since  there  is  almost  always  a  prickle  at  the  tip, 
asymmetrical  leaves  must  be  very  nearly  as  frequent  as  symmetrical. 

Lastly,  turning  to  the  comparison  of  racial  and  individual  variabilities,  we  have  the 
table  : — 


Series. 

S.  D. 

S.  D.  of  array. 

Coefficient  of 
variation. 

Percentage 
variability. 

Somersetshire  (i.)  .  . 
Dorsetshire  (ii.)'f  .  . 
„        {ii.)h  .  . 

2-  8655 

3-  7595 
3-5661 

2-6680 
2-8815 
2-8282 

18-74 
27-98 
26-29 

93-12 
76-65 
80-11 

.    Mean  (i.)  and  (ii.)'f 
Mean  (i.)  and  {u.)h. 

3-3125 
3-2158 

2-7747 
2-7482 

23  36 
22-52 

84-88 
86-61 

Here  again  we  see  how  great  is  the  variability  within  the  individual.  Looking 
upon  Tree  (91)  as  so  largely  biasing  the  complete  Dorsetshire  series  that  we  must 
exclude  it,  we  see  that  the  individual  possesses  80  to  90  per  cent,  of  the  variability  of 
the  race.  The  variability  of  the  offspring  of  the  individual  will,  of  course,  be  larger 
still,  even  if  we  had  merely  uniparental  reproduction.  Thus  we  have  still  further 
evidence,  if  more  were  necessary,  that  variability  is  not  a  factor  dependent  upon 
sexual  reproduction.  For  this  character  of  the  prickles  it  will  be  seen  that  the  holly 
leaf  is  more  variable  than  those  of  the  ash,  chestnut,  or  beech.  The  following  are  the 
fundamental  tables  of  data  : — 
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Section  11.— Seed  Vessels  of  Flowers.    A.  Poppies. 

(11.)  Having  dealt  with  several  characters  in  a  variety  of  trees,  I  thought  it 
desirable  to  deal  with  a  single  character  in  a  variety  of  races  in  a  very  different 
branch  of  the  vegetable  kingdom.  After  consultation  with  my  colleague,  Professor 
F.  O.  Oliver,  the  stigmatic  bands  on  the  seed  capsules  of  poppies  were  selected,  and 
they  have  proved  an  easily  recorded  character,  which  could  be  considered  for  a  variety 
of  races.  Here  again  there  was  some  diversity  in  the  method  of  collection.  The 
rules  given  being  (i.)  that  the  bands  were  to  be  counted  at  the  periphery  of  the 

capsule  and  not  at  the  centre;  thus  "^^'^^  counted  as  7,  not  6,  bands;  and 

(ii.)  every  possible  bud  on  the  plant  was  to  be  counted.  The  work  went  on  from 
spring  to  autumn  of  the  year  1899  ;  it  was  impossible  to  collect,  dry,  and  preserve  for 
counting  all  the  wild  poppies  gathered  as  soon  as  they  had  ceased  flowering.  Thus 
some  poppies  were  taken  when  they  had  completely  done  flowering,  others  with  flowers 
on  them,  and  others  with  many  buds.  Few,  I  think,  would  be  included  which  had 
not  formed  all  their  buds.  In  my  own  series,  I  counted  the  bands  on  every  capsule, 
in  every  flower,  and  in  nearly  every  bud,  opening  the  bud  and  using  a  microscope 
when  necessary.  The  labour  of  this  was  very  great,  and  was  not  carried  out  by  all 
my  helpers.  I  doubt,  indeed,  whether  any  advantage  was  derived  from  it,  except  in 
so  far  as  there  may  be  correlation  between  the  number  of  stigmatic  bands  and  the 
early  or  late  development  of  the  seed-vessel. 
(12.)  The  following  series  were  obtained  : — 

Series  (i.),  Buckinghamshire  Wild  Poppy  (Papaver  Pthoeas). — I  gathered 
200  specimens  from  as  wide  a  range  as  possible — roadside,  field-hedgerow,  and 
cornfield — avoiding  Papaver  duhium,  and  counting  every  set  of  stigmatic  bands  right 
down  to  the  smallest  bud  I  could  manage.  These  specimens  were  gathered  on  the 
very  top  of  the  Chilterns  ;  the  soil  and  climate  is  not  a  very  favourable  one,  and 
my  mean  number  of  buds  was  a  small  one.  I  myself  tabled  the  data  and  calculated 
the  constants. 

Series  (ii.).  Buckinghamshire  Wild  Poppy  (Papaver  Rhceas). — Miss  C.  D.  Fawcett 
gathered  236  poppy  plants  on  the  southern  slope  of  the  Chilterns,  towards  their  foot. 
They  came  nearly  all  from  a  single  cornfield,  and  thus  their  environment  was  much 
more  uniform,  and  the  climate  more  favourable  than  in  Series  (i.).  It  is  quite 
possible  that  these  j^oppies  were  largely  the  oflspring  of  a  comparatively  few  poppies, 
as  they  were  gathered  from  a  small  area.  Dr.  Lee  tabled  the  data  and  calculated  the 
constants. 

Series  (iii.).  Somersetshire  Wild  Poppy  (Papaver  Hhceas). — The  bulk  of  this  series 
were  gathered  by  Miss  Agnes  Fry  in  the  Quantocks,  but  being  rather  a  short  series, 
86  plants,  I  added  to  it  several  other  short  series,  42  plants  in  all,  gathered  and 
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counted  by  various  less  active  helpers.  Thus  the  series  is  really  an  omnibus  series, 
although  mainly  from  the  Quantocks.  Dr.  Lee  tabulated  the  data  and  calculated  the 
constants. 

My  other  two  series  are  of  the  Shirley  Popjjy,  a  well-known  and  beautiful  garden 
variety. 

Series  (iv.).  Chelsea  Shirley  Popjjy. — -These  plants  were  grown  by  my  colleague, 
Professor  F.  O.  Oliver,  in  his  garden  at  Chelsea.  The  plants  had  scarcely  been 
thinned,  and  so,  being  very  crowded,  grew  tall,  and  with  few  buds  on  the  plant.  The 
plants  were  taken  up  and  forwarded  to  me  by  rail,  but  being  delayed  in  transit 
suffered  a  great  deal,  so  that  a  good  many  cajDSules  w^ere  broken  off,  or  could  not  be 
counted.  These  circumstances  account  for  the  smallness  of  the  number  of  pairs 
obtained  from  each  plant.  Professor  Oliver  kindly  came  and  assisted  me  in  the 
counting.    The  data  were  tabulated  and  reduced  by  myself 

Series  (v.).  Buckinghcwishire  Shirley  Poppy. — These  plants  were  grown  at 
Hampden  Farm  House,  and  covered  a  large  border.  They  were  extremely  healthy, 
large  plants,  and  I  have  counted  as  many  as  120  capsules  on  a  single  plant  !  The 
plants  were  taken  ujd  singly  as  they  had  done  flowering  and  the  bands  counted.  The 
labour  of  counting  was  severe,  as  there  were  4443  capsules  on  176  plants  as  compared 
with  1020  capsules  on  325  Chelsea  poppies.  The  tabulation  involved  the  entry  of 
197,478  pairs,  perhaps  the  largest  number  we  have  yet  dealt  with  in  a  single  corre- 
lation table  ;  this  and  the  determination  of  the  constants  is  the  work  of  Dr.  Lee. 

The  reader  will  notice  that  the  investigations  on  poppies  differ  largely  from  those 
on  the  leaves  of  trees.  In  the  latter  case,  26  leaves  were  taken  at  random  from  the 
trees  and  the  character  on  each  measured.  In  the  former  case,  every  available 
capsule  was  dealt  with.  These,  of  course,  are  not  all  the  capsules  put  forth  by  the 
plant,  and  the  proportion  of  these  two  sets  would  vary  considerably  from  one  series  to 
another,  ajDproaching  fairly  close  to  equality  in  my  wild  poj^pies  from  the  toj)  of  the 
Chilterns,  and  differing  very  considerably  in  the  Shirley  poppies  from  Chelsea.  In  the 
leaves  of  the  trees  no  tree  was  weighted  by  its  size  or  vigour  ;  in  the  case  of  a 
vigorous  poppy,  however,  it  may  contribute  ten  times  as  many  pairs  to  our  table  as  a 
feeble  plant.  There  is  a  good  deal  to  be  said  for  both  methods  of  dealing  with  the 
problem,  and  even  for  a  third  method  to  which  I  shall  refer  immediately.  If  we  pick 
up  two  leaves  at  random  and  wish  to  consider  the  probability  that  they  belong  to  one 
tree,  we  ought  certainly  to  allow  for  the  larger  number  of  pairs  of  leaves  on  the 
larger  trees ;  if  we  wish  to  discover  to  what  extent  two  randomly  chosen  men  have 
characters  like  to  those  of  brothers,  we  ought  certainly  to  allow  for  the  large 
families,  even  if  we  do  not  deal  with  every  possible  pair  of  brothers  in  those  families. 
These  two  different  methods  of  dealing  with  resemblance  due  either  to  the  production 
of  like  organs  by  the  individual  or  to  heredity  must  be  carefully  borne  in  mind.  In 
dealing  with  Mr.  Galton's  data  for  stature*  I  have  taken  every  possible  pair  of 

*  '  Phil.  Trans.,'  A,  vol.  187,  pp.  253-318. 
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brothers  ;  in  my  own  measurements  of  1000  families  I  have  taken  only  two  brothers 
from  each  family.  We  cannot  a  priori  expect  the  results  to  come  out  exactly  the  same. 
Indeed,  we  might  a  priori  exj^ect  the  latter  method  to  give  rather  higher  results 
than  the  former,  for  it  introduces  more  individual  families,  and  if  we  tabulated  all 
the  brothers  of  one  family  only  or  all  the  like  organs  of  one  organism,  we  should  find 
nothing  but  a  spurious  correlation  growing  indefinitely  small  as  the  number  of 
brothers  or  organs  was  indefinitely  increased.'*  Thus  I  anticipated  that  the 
Hampden  Shirleys  would  give  a  lower  correlation  than  the  Chelsea  Shirleys,  the 
wild  poppies  from  the  top  of  the  Chilterns  a  higher  value  than  those  from  the 
bottom,  &c. 

As  our  series  of  popjDy  j)lants  were  in  no  case  very  large,  it  was  impossible  to  directly 
test  this  result  by  taking  only  a  single  pair  from  each  plant,  even  if  it  had  been 
possible  after  the  counting  ajid  marking  to  select  a  pair  of  capsules  at  random.  I 
accordingly  adopted  a  third  or  intermediate  method.  Miss  Fawcett  had  recorded 
her  capsules  under  the  letters  a,  h,  c,  &c.,  and  not  grouped  them  directly  mider 
the  nimiber  of  the  stiginatic  bands.  Accordingly,  it  was  possible  to  take  a  with  h, 
c  with  d,  e  with  f,  &c.,  and  thus  repeat  no  single  capsules,  but  get  a  number  of 
pairs  sensibly  taken  at  random.  Thus  large  and  vigorous  plants  will  give  more 
pairs  than  feeble  ones,  and  yet  much  fewer  than  if  we  take  every  possible  pair. 
This  series  will  be  referred  to  as  Series  (ii.)^".  Its  general  result  confirms  the  anti- 
cipation referred  to  above.  It  will  be  noticed  that  nothing  of  this  kind  can  be 
done  in  the  case  of  leaves  of  trees,  where  we  cannot  gather  every  leaf.  One  great 
advantage  of  the  26  leaves  from  every  tree  is  the  means  it  provides  of  testing  the 
accuracy  of  the  lengthy  process  of  tabulation  and  calculation.  Each  leaf  is  used  25 
times,  and  thus  the  mean  and  standard-deviation  of  the  total  number  of  pairs  ought 
to  be  the  same  as  those  of  the  total  series  of  leaves  (say,  2600  for  100  trees).  This 
valuable  method  of  verification  is  lost  when  we  do  not  take  the  same  number  of 
like  organs  from  each  individual.  It  then  becomes  of  considerable  interest  to  ascer- 
tain how  far  the  mean  and  variability  of  the  total  number  of  organs  observed  are 
identical  with  the  mean  and  variability  of  the  total  number  of  pairs  of  organs 
observed.  This  comparison  is  made  in  Series  (v. the  mean  and  variability  of  the 
4443  capsules  being  compared  with  those  of  the  corresponding  197,478  pairs.  The 
whole  of  the  tabulation  and  arithmetic  involved  in  Series  (ii.)'"'*  is  mine,  but  Series  (v.)'-"* 
is  again  due  to  Dr.  Lee. 

I  now  proceed  to  sum  up  the  results  obtained  from  the  above  seven  series. 


I  worked  out  the  theory  of  this  some  time  ago,  and  may  pulilish  it  on  another  occasion. 
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Resemblance  of  Poppy  CajDSules  from  the  same  Plant. 


Series. 

Number  of 

Mean  niimber  of 
bands. 

S.  D.  of  bands. 

Correlation. 

Plants. 

Capsules. 

Pairs.* 

Wild  poppy  (i.)  . 
(top  of  Chilteriis) 
Wild  poppy  (ii.)  . 
(southern  slope  of 

Chilterns) 
Wild  popp3^  (ii.)'''"' 
(no  capsule  re- 
■nea.terl  ^ 
Wild  poppy  (iii.)  . 
(Quantocks) 

200 
2.36 

236 

127 

1305 
2343 

2268 

1324 

11,026 
25,160 

2,268 

19,790 

10-0443  ±  -0331 
9-9055  ±  -0210 

9-8444  ±  -0211 

8-7733  ±  -0270 

1-7704  ±  -0234 
1-5087  ±  -0149 

1-4878  ±  -0149 

1-4553  ±  -0191 

-5620  [±  -0128] 
-3997  [±  -0117] 

•4452  [±  -0114] 

-5333  [±  -0133] 

Shirley  poppy  (iv.) 

(Chelsea) 
Shirley  poppy  (v.) 
(Hampden) 

325 
176 

1020 
4443 

2,756 
197,478 

12-3679  ±  -0.355 
12-6103  ±  -0191 

1-6803  ±  -0251 
1-8853  ±  -0135 

•6149  [±  -0131] 
-5238  [±  -0073] 

Mean  (i.),  (ii.),  (iii.) 
Mean  (iv.),  (v.) 

563 
501 

4972 
5463 

55,976 
200,234 

9-5744 
12-4891 

1-5781 
1-7828 

•4983 
-5693 

Mean  (i.)  to  (v.)  . 

1064 

10,435 

256,210 

[10-6524] 

[1-6600] 

•5267 

Now  this  table  contains  a  good  deal  of  interest.  Upwards  of  a  thousand  poppy 
plants  were  dealt  with,  involving  the  counting  of  the  bands  on  more  than  ten  thousand 
capsules,  and  the  entry  of  more  than  a  quarter  of  a  million  of  pairs  in  the  correlation 
tables.  We  see  at  once  that  the  mean  and  variability  of  the  Shirley  poppy  differ  so 
widely  from  those  of  the  wild  poppy,  that  it  is  idle  to  group  these  species  for  these 
characters  together.  It  is  interesting  to  note,  however,  that  both  in  the  number  of 
bands  and  in  the  variability  the  Hampden  poppies,  whether  wild  or  Shirley,  stand  at 
the  top  of  their  respective  series.  With  regard  to  the  wild  poppy,  I  probably 
gathered  from  a  wider  area  with  a  greater  variety  of  environment  than  was  the  case  in 
Series  (ii.)  or  (iii.).  That  mine  were  gathered  from  much  poorer  soil  is  evidenced  by 
my  plants  having  on  an  average  only  6  cajDsules,  while  the  Quantocks  and  southern 
Chiltern  poppies  had  about  10  capsules.  The  difference  is  possibly  even  greater  than 
these  numbers  indicate,  for  I  counted  every  bud  I  possibly  could.  The  two  Chiltern 
series  do  not  differ  much  in  the  mean  number  of  bands,  but  the  difference  of  variability 
is  very  significant.  I  should  take  it  to  be  a  very  fair  measure  of  the  relative  range 
of  environment  in  the  two  cases.  When  we  turn  to  the  Shirley  series  the 
difference  is  almost  too  great  for  any  comjjarison.  While  the  Hampden  series  had 
some  twenty-five  capsules,  on  an  average,  to  the  plant,  the  Chelsea  series  numbered 

*  The  reader  must  bear  in  mind  that  this  column  gives  the  number  of  pairs  of  the  symmetrical  tables 
below,  or  twire  the  number  of  independent  pairs. 
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only  about  three  !  In  fact,  besides  being  extremely  crowded  a  process  of  selection 
had  gone  on  in  the  transit.  From  examination  of  the  uninjured  plants,  I  think  that 
five  capsules  a  plant  would  be  more  than  the  average  of  these  poppies  before  packing, 
and  that  at  least  one  to  two  capsules  per  plant  were  destroyed  on  the  jom-ney. 
Accordingly,  I  do  not  consider  that  much  weight  should  be  given  to  the  results  of 
this  series,  but  I  felt  bound  to  include  it  as  I  have  included  all  the  series  dealt 
with,  even  when  I  hold  them  to  be  really  unsuited  to  the  present  investigation. 
If  we  consider  the  two  methods  of  dealing  with  Series  (ii.),  i.e.,  that  of  including 
all  possible  pairs,  and  that  of  including  pairs  in  which  no  capsule  was  repeated,  we 
find  that  no  very  sensible  change  is  made  in  the  mean  or  variation,  but  a  rather 
greater  increase  in  the  correlation  than  might  have  been  anticipated.  I  think  this  is 
largely  due  to  unconscious  selection  in  the  choice  of  pairs,  i.e.,  we  took  the  records 
for  (a)  and  (h),  (c)  and  (d),  &c.,  as  our  pairs  ;  it  is  quite  possible  that  in  recording  the 
capsules  successive  letters  would  correspond  to  capsules  gathered  from  the  same  part 
of  the  plant,  and  so  the  members  of  the  pair  would  be  subject  to  the  same  or 
somewhat  similar  conditions  of  nutriment,  light  and  warmth  during  development. 
Valuable  investigations  have  yet  to  be  made  on  the  relative  degree  of  resemblance  of 
leaves  or  seed-vessels  coming  at  random  from  the  same  tree  or  plant,  and  coming  from 
the  same  branch  or  shoot.  The  effects  of  position  on  the  individual  undoubtedly 
produces  some  differentiation,  and  all  we  can  hope  to  do  at  present  is  to  get  round 
numbers  approximately  describing  the  degree  of  resemblance  of  these  theoretically 
undifferentiated  like  organs. 

If  we  exclude  the  Chelsea  series,  we  have  for  the 


Wild  poppy   -4983 

Shirley  poppy    .   -5238 

Mean  .    .    .    .  -5110 

For  the  four  series  of  trees  we  have 

Ash   -3918 

Chestnut   -5284 

Holly   .   -5035 

Beech  .    .,   -5699 

Mean  ....  '4984 


So  far  as  the  seed-vessels  of  poppies  and  the  leaves  of  trees  are  concerned  we  have 
a  degree  of  resemblance  in  undifferentiated  like  organs  which  approximates  to  the 
value  "5. 

VOL.  cxcvii. — A.  2  s 
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It  will  be  as  well  to  put  together  the  frequency  distributions  of  the  capsules  as  we 
have  done  in  the  case  of  the  trees : — 


Wild  Poppies  {Papaver  Rhceas).    Frequency  of  Stigmatic  Bands. 


Series. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

Totals. 

Chilterns  (i.)  . 

2 

17 

86 

182 

255 

272 

260 

130 

67 

25 

9 

1305 

Chilterns  (ii.)  . 

1 

13 

103 

304 

571 

634 

428 

198 

56 

26 

6 

3 

2343 

Quantocks  (iii.)  . 

17 

61 

189 

358 

316 

227 

96 

39 

20 

1 

1324 

Total..    .  . 

20 

91 

378 

844 

1142 

1133 

784 

367 

143 

52 

15 

4972 

Shirley  Poppies.    Frequency  of  Stigmatic  Bands. 


Series. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

Totals. 

Chelsea  (i.).  . 
Hampden  (ii.) 

0 
1 

1 
11 

1 

32 

4 
56 

40 
148 

95 
363 

167 
628 

245 
925 

202 
954 

145 
709 

86 
397 

26 
155 

7 
51 

1 

12 

0 
1 

1020 
4443 

Total  . 

1 

12 

33 

60 

188 

458 

795 

1170 

1156 

854 

483  181 

58 

13 

1 

5463 

These  distributions  appear  to  be  uniform  although  asymmetrical,  and  they  give 
no  obvious  sign  of  a  mixture.  The  mode  is  10  for  the  Chilterns  series  and  8  for  the 
Quantocks  series,  while  the  Shirley  poppies  give  12  for  Chelsea  and  13  for  Hampden. 

It  remains  to  tabulate  the  several  results  obtained  for  variation.  These  are 
given  in  the  following  table.  We  see  at  once  that  the  general  effect  of  weighting 
the  capsules  with  the  number  of  pairs  on  the  plant  has  the  effect  of  increasing  the 
mean  and  reducing  the  variability  ;  in  other  words,  the  plants  with  most  capsules  on 
and  leading  to  most  pairs,  are  those  with  the  greatest  number  of  stigmatic  bands. 
We  have  some  evidence  accordingly  that  the  most  vigorous  plants  exhibit  most 
bands  on  the  capsules  : — 
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Individual  and  Racial  Variability. 


Series. 

Stigmatic  bands. 

Coefficient  of 
variation. 

S.  D. 

of  array. 

Percentage 
variability. 

Mean. 

S.  D. 

JFild  Poppy. 

(i.)  from  pairs  

„     „  capsules  

(ii.)  from  pairs  

,,    pairs  without  repetition  of 

capsules  

(iii.)  from  pairs  

,,       „  capsules  

10-0443 
9-9318 
9-9055 
9-8288 

9-8444 
8-7733 
8-7236 

1-7704 
1-7684 
1-5087 
1-5007 

1-4878 
1-4553 
1-5402 

17-6259 
17-8052 
15-2309 
15-2677 

15-  1132 

16-  5878 

17-  6556 

1 -4644 
1-3829 

1-3322 
1-2311 

82-71 
91-66 

89-54 
84-59 

Mean  from  pairs  .... 
Mean  from  capsules  .  . 

9-5744 

1-5781 

1  DUOi 

16-4809 
16-9095 

1-3661 

86-32 

Shirley  Foppy. 

,,      „  capsules  

(v.)  from  pairs  

„       ,,  capsules  

12-3679 
12-3873 
12-6103 
12-5091 

1-6803 
1-7326 
1-8853 
1-8977 

13-5860 

13-  9869 

14-  9505 

15-  1705 

1-3251 
1-6060 

78-86 
85-18 

Mean  from  pairs  .... 
Mean  from  capsules.    .  . 

12-4891 
12-4482 

1-7828 
1-8151 

14-2682 
14-5787 

1-4655 

82-02 

Total  mean  from  pairs  . 
Total  mean  from  capsules  . 

15-5962 
15-9772 

1-4019 

84-60 

But  we  might  well  have  expected  a  good  deal  more  evidence  of  this.  The 
difiPerences  of  the  means  calculated  from  capsules  and  pairs  is  not  very  great,  and 
in  two  out  of  three  series  of  the  wild  poppies  the  variability  as  deduced  from  pairs  is 
very  slightly  greater  than  (sensibly  equal  to)  that  deduced  from  capsules.  The  mean 
values  of  the  coefficient  of  variation,  as  calculated  from  pairs  and  from  capsules, 
are  in  close  agreement.  The  only  sensible  deviation  is  that  of  the  Quantocks 
series,  and  this  arises  from  the  large  difference  in  the  standard  deviations  of  the 
two  methods.    The  work  has  been  carefully  revised,  but  no  error  discovered. 

Generally  for  the  coefficient  of  variation  we  have — 


Ash  :  17-56 

Spanish  chestnut  .    .    .    .  15  "05 

Beech   1077 

Holly   23-36 


Mean  . 


16-67 


Wild  poppy  16-91 

Shirley  poppy  14-58 


Mean 


15-98 


2  s  2 
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So  that  the  variabiHty  of  both  may  be  expressed  in  round  numbers  as  16.  It  is 
clearly  not  a  constant  for  the  whole  vegetable  kingdom,  but  16  enables  us  to 
form  some  rough  appreciation  of  the  plasticity  of  vegetable  forms,  and  may  be 
compared  with  values  obtained  from  other  series  later. 

Comparing  again  the  ratio  of  the  racial  and  individual  variabilities,  we  see  that 
the  latter  reaches,  on  an  average,  over  85  per  cent,  of  the  former,  the  minimum  value 
being  79  and  the  maximum  91.  It  is  difficult  to  test  this  point  directly,  for  the 
very  simple  reason  that  a  single  plant  will,  as  a  rule,  have  very  few  flowers ;  and  upon 
small  numbers  it  is  impossible  to  test  satisfactorily  the  full  variability  of  the  plant.  The 
probable  error  becomes  enormous  if  we  have  only  four  or  five  capsules  to  determine 
the  standard  deviation  from.  The  method  adopted  has  been  that  which  arises  in 
the  ■  treatment  of  parental  inheritance  ;  all  the  offspring  of  parents  of  a  given 
stature  are  clubbed  together  and  form  an  array.  This  array  has  a  certain  varia- 
bility, and  this  variability  is  taken  to  be  that  of  a  single  family  of  offspring  having 
parents  of  the  given  stature.  It  would  clearly  be  impossible  to  find  the  variability 
inside  the  family  from  numbers  such  as  occur  in  a  single  human  family.  To  some 
extent  we  can  verify  the  assumptions  thus  made.  Thus  I  took  the  wild  poppy  with 
the  largest  number  of  capsules  I  had  come  across,  and  determined  their  variability  ; 
there  were  36  such  capsules,  and  the  standard  deviation  was  1"4519.  I  then  took 
two  of  the  most  jDrolific  Shirley  poppies  which  had  126  and  85  capsules  on  them, 
and  their  variabilities  were  given  by  standard  deviations  of  1"3290  and  1*5889,  i.e., 
a  mean  of  1"4589.  These  are  actually  in  excess  of  the  standard  deviations  of  the 
arrays,  namely,  1*3661  and  1*4019  of  the  two  series  of  poppies  !  I  think  we  may 
accordingly  take  the  latter  as  a  reasonable  measure  of  the  possible  variation  of  the 
individual. 


Table  IX.— Wild  Poppy  (Chilterns'  Top)  Series  (i.). 

Number  of  Stigviatic  Bands  on  First  Capsule. 


5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

Totals. 

5 

0 

8 

4 

12 

6 

8 

38 

46 

23 

11 

5 

2 

133 

7 

4 

46 

184 

163 

146 

78 

31 

8 

1 

661 

8 

23 

163 

390 

398 

279 

111 

75 

32 

4 

1475 

9 

11 

146 

398 

554 

415 

250 

161 

68 

9 

2017 

10 

5 

78 

279 

415 

514 

520 

240 

112 

27 

10 

2200 

11 

2 

31 

111 

250 

520 

770 

366 

160 

52 

9 

2271 

12 

8 

75 

161 

240 

366 

252 

178 

35 

12 

1327 

13 

1 

32 

68 

112 

160 

178 

92 

23 

11 

677 

14 

4 

9 

27 

52 

35 

23 

28 

12 

190 

15 

_ 

5 

10 

9 

12 

11 

12 

4 

63 

Totals 

12 

133 

661 

1475 

2017 

2200 

2271 

1327 

677 

190 

63 

11026 

Si 
s 

&2 


Si 


•  "Si 
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Table  X.— Wild  Poppy  (Chilterns'  Base)  Series  (ii.). 


Number  of  Stigmatic  Bands  on  First  Capsule. 


er  of  Stigmatic  Bands  on  Second  Capsule. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

Totals. 

5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 

0 
1 

2 

1 

2 

1 

4 

1 

4 
40 
25 
19 
10 
9 
1 
1 

2 
40 
184 
289 
228 
145 
48 
20 
4 
1 

1 
25 
289 
822 
964 
563 
283 
105 
20 
8 

2 

19 

228 
964 
1868 
1633 
855 
353 
78 
32 
5 
4 

1 

10 

145 
563 
1633 
2106 
1469 
682 
151 
60 
17 
8 

4 
9 
48 
283 
855 
1469 
1220 
585 
154 
63 
20 
13 

1 

20 
105 
353 
682 
585 
336 
134 
54 
14 
14 

1 
4 
20 
78 
151 
154 
134 
52 
44 
17 
7 

1 

8 
32 
60 
63 
54 
44 
26 
8 
1 

5 
17 
20 
14 
17 

I 

1 

4 
8 
13 
14 
7 
1 
1 
0 

11 
110 
961 
3080 
6041 
6845 
4723 
2298 
662 
297 
84 
48 

Numb 

Totals 

11 

110 

961 

3080 

6041 

6845 

4723 

! 

2298 

662 

297 

!  84 

i 

48 

25160 

1 

Table  XL— Wild  Poppies  (Chilterus'  Base)  (ii.)'''^ 
(No  repetition  of  any  individual  capsule.) 


Number  oj  Stigmatic  Bands  on  First  Capsule. 


5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

Totals. 

5 
6 
7 
8 
9 

10 
11 

12 
13 
14 
15 
16 

1 

2 
5 
1 
1 
0 
3 

5 
24 
28 
19 
13 
1 
1 

1 

28 
80 
79 
73 
24 
8 
2 

1 
1 
19 
79 
202 
140 
74 
25 
5 
4 

0 
13 
73 
140 
198 
127 
50 
10 
7 
1 

- 

3 
1 

24 
74 
127 
112 
59 
10 
6 
0 
2 

1 
8 
25 
50 
59 
32 
13 
5 
2 

0 

2 
5 
10 
10 
13 
10 
3 
1 
0 

4 
7 
6 
5 
3 
0 
0 
0 

1 

0 
2 

1 

0 
0 
1 

2 
0 
0 
0 
1 
0 

1 

12 
91 
295 
550 
619 
418 
195 
54 
25 
5 
3 

Totals 

1 

12 

91 

295 

550 

619 

418 

195 

54 

25 

5 

3 

1  2268 
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Table  XII. — Wild  Poppy  (Quantocks)  Series  (iii.). 

Number  of  Stigmatic  Bands  on  First  Capsule. 


5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

Totals. 

5 

14 

41 

31 

22 

16 

7 

4 

0 

1 

0 

136 

6 

41 

170 

299 

202 

93 

40 

9 

0 

9 

0 

863 

7 

31 

299 

922 

895 

375 

113 

22 

2 

7 

0 

2666 

8 

22 

202 

895 

1716 

1312 

687 

178 

22 

11 

0 

5045 

9 

16 

93 

375 

1312 

1590 

1275 

337 

43 

19 

0 

5060 

10 

7 

40 

113 

687 

1275 

1234 

406 

92 

52 

0 

3906 

11 

4 

9 

22 

178 

337 

406 

308 

138 

51 

1 

1454 

12 

0 

0 

2 

22 

43 

92 

138 

102 

37 

1 

437 

13 

1 

9 

7 

11 

19 

52 

51 

37 

26 

4 

217 

14 

0 

0 

0 

0 

0 

0 

1 

1 

4 

0 

6 

Totals . . . 

136 

863 

2666 

5045 

5060 

3906 

1454 

437 

217 

6 

19790 

Table  XIII. — Shirley  Poppies  (Chelsea)  Series  (iv.). 


Number  of  Stigmatic  Bands  on  First  Capsule. 


6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 

Totals. 

6 

1 

1 

7 

0 

0 

1 

1 

8 

1 

0 

0 

0 

2 

0 

0 

3 

6 

9 

0 

0 

20 

37 

29 

15 

9 

110 

10 

1 

2 

37 

88 

64 

39 

19 

4 

1 

255 

11 

0 

29 

64 

140 

138 

70 

19 

4 

1 

465 

12 

0 

15 

39 

138 

196 

154 

68 

25 

2 

637 

13 

3 

9 

19 

70 

154 

174 

105 

55 

8 

597 

14 

4 

19 

68 

105 

112 

63 

17 

2 

1 

391 

15 

1 

4 

25 

55 

63 

52 

16 

4 

0 

220 

16 

1 

2 

8 

17 

16 

4 

4 

1 

53 

17 

2 

4 

4 

6 

1 

17 

18 

1 

0 

1 

1 

0 

3 

Totals 

1 

1 

6 

110 

255 

465 

637 

597 

391 

220 

53 

17 

3 

2756 
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(13.)  C.  Nigella  Hispanica. — Besides  the  poppy  capsules,  Professor  Oliver  had 
suggested  to  me  that  it  would  be  possible  to  count  the  segmentation  of  the  seed 
vessel  in  the  cases  of  Nigella  and  Mallow.  I  accordingly  procured  from  Messrs. 
Veitch,  of  Slough,  upwards  of  a  hundred  plants  of  Nigella  Hispanica.  Meanwhile, 
Professor  Oliver  sent  me  a  list  of  some  he  had  counted  himself,  and  pointed  out 
that  there  was  a  substantial  differentiation  between  the  seed  vessels  on  the  main 
stem  and  those  on  the  side  shoots.  It  was  quite  clear  on  examination  that  these 
seed  vessels  were  not  undifferentiated  like  organs,  the  number  of  segments  in  a  seed 
vessel  on  the  side  shoots  being  much  more  like  the  number  on  a  similar  vessel  on  a 
separate  plant  than  like  those  of  a  main-shoot  capsule  on  the  same  plant.  Further, 
although  I  do  not  know  the  history  of  these  special  plants,  they  were  probably  from 
highly  selected  seed  and  in  a  very  unstable  condition.  Whereas  in  the  examination  of 
many  thousand  poppy  capsules  I  only  found  one  remarkable  abnormal  case,"^  abnor- 
malities in  the  case  of  Nigella  Hispanica  abounded  and  rendered  the  counting  of  the 
segmentation  very  difficult.  Double  and  treble  capsules  were  found  in  every  state  of 
union  up  to  complete  fusion,  and  with  every  variety  of  deformity,  such  as  the  tips  of 
one  capsule  growing  through  the  sides  of  a  second  capsule.  Thus  the  material  was 
by  no  means  suited  to  my  present  purpose.  However,  as  I  had  the  material  it 
seemed  worth  while  testing  the  influence  of  differentiation  in  reducing  correlation. 
Accordingly  I  made  no  attempt  to  separate  top  from  side  capsules,!  and  formed  the 
following  table  in  the  manner  sufficiently  indicated  in  the  preceding  pages  : — 


*  A  double  capsule  on  one  stem.  It  contained  only  a  few  seeds.  These  Professor  F.  0.  Oliver  kindly 
sowed  for  me  this  year ;  there  resulted  two  plants  with  a  very  few  poor  flowers,  but  the  abnormality  was 
not  perpetuated,  and  the  capsules  rotted  off  without  giving  seed. 

t  If  the  top  capsules  only  had  been  taken  I  should  often  have  had  only  one  on  a  plant. 
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While  the  modal  value  is  conspicuously  8,  the  mean  value  is  7 '4639.  This  is  due 
to  the  second  modal  value  of  5  on  the  side  shoots.  The  standard  deviation  is  1"4893, 
and  the  correlation  'IB 99.  It  will  he  seen  at  once  how  the  result  fully  confirmed  my 
expectation,  that  the  degree  of  resemblance  between  capsules  on  the  same  plant 
would  be  largely  reduced,  owing  to  the  existence  of  differentiation.  The  actual 
distribution  of  capsides  in  210  plants  was  as  follows  : — - 


Number  of  Segments. 


2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12.!  13. 

14. 

15. 

16. 

17. 

18. 

19. 

20. 

Total. 

Frequency.    .  . 

1 

1 

5 

75 

108 

133 

420 

61 

19 

10 

8  i  2 

1 

0 

0 

1 

0 

0 

0 

1 

'  845 

Thus  on  an  average  there  were  four  seed  vessels  to  a  plant.  The  mean  and  standkrd 
deviation  found  directly  from  the  capsules  are  7'5101  and  1'4559  respectively. 
These  are  in  substantial  agreement  with  the  above  results  found  for  pairs.  They  do 
not  confirm  the  conclusion  based  upon  the  data  for  poppies,  namely  that  if  we  work 
with  pairs  we  shall  raise  the  mean  and  lower  the  variability.  The  fact  is  that  vigour 
seems  to  work  in  opposite  directions  with  this  plant  and  the  poppy  ;  with  the  latter 
it  means  many  capsules  with  a  high  number  of  stigmatic  bands,  but  with  the  former 
it  means  many  side  shoots  with  the  differentiated  capsules  with  few  segmentations. 
Such  capsules  of  course  decrease  the  mean  and  raise  the  variability. 

The  coefficient  of  variation  is  19 "385,  sensibly  greater  than  that  of  the  wild  poppy, 
but  below  that  of  the  holly  leaf  The  standard  deviation  of  an  array  is  1-4622,  or 
the  variabiKty  of  the  individual  98-18  per  cent,  of  that  of  the  race.  The  plant  with 
the  greatest  number  of  seed  vessels  on  had  only  11,  and  these  gave  a  standard 
deviation  of  1-6188,  greater  than  that  of  the  whole  race  !  But  of  course  this  is  subject 
to  a  very  large  probable  error ;  it  suffices  to  emphasise,  however,  how  the  source  of 
variation  is  to  be  sought  within  the  individual,  and  is  not  a  mere  result  of  sexual 
reproduction. 

The  following  tal)le  sums  up  the  results  obtained  for  Nigella  Hispanica  : — 


Number  of 

Mean. 

S.  D. 

Coefficient 
of 

variation. 

S.  D.  of 

Per- 
centage 
variation. 

Correla- 

Plants. 

Capsules. 

Pairs. 

array. 

tion. 

Pairs.  . 
Capsules 

1  210 

845 

3212  1 

7-4639 
7-5101 

1-4893 
1  -  4559 

19-385 

1-4622 

98-18 

•1899 

(14.)  D.  Malloiv  (Malva  Eotundifolia). — I  had  given  up  the  idea  of  the  ordinary 
wild  mallow  {Malva  sylvestris)  owing  to  the  few  plants  to  be  found  in  the  neighbour- 
hood of  Hampden,  as  well  as  on  account  of  the  immense  number  of  seed  vessels  on 
the  plants  to  be  counted.    But  one  day  an  inspection  of  my  potato-patch  revealed 
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the  whole  ground  covered  with  low  plants  of  Malva  Rotundifolia*  On  inquiry  I 
found  that  a  cottage  had  recently  stood  on  the  site,  and  that  the  cottagers  kept  in 
their  herb-bed  a  small  clump  of  this  plant  for  the  preparation  of  a  local  ointment  of 
high  repute.  When  the  cottage  was  removed  the  mallow  had  spread  over  the  whole 
patch,  I  believe  very  little  by  seed,  but  largely  by  stolons.  This  seemed  to  provide 
excellent  material  ready  to  hand,  and  from  it  we  can  draw  some  conclusions  as  to  the 
influence  of  such  a  division  on  the  degree  of  resemblance  of  like  parts.  It  will  be 
remembered  that  Darwin, t  in  experimenting  on  Origanum  vulgare,  found  that 
cross-fertilisation  of  flowers  on  plants  which  had  separate  roots,  but  had  spread  by 
stolons  from  the  same  individual,  presented  no  advantage  over  self-fertilisation,  and 
he  concluded  that  this  arose  from  the  fact  that  the  plants  were  really  one  and  the 
same  individual  so  far  as  such  processes  were  concerned.  If  this  were  so  in  the  case 
of  my  mallow,  we  ought  to  find  a  great  reduction  in  the  relationship  of  undiffer- 
entiated like  organs.  Two  points  struck  me  at  the  outset — the  small  variability  in 
the  segmentation  of  the  seed  vessels  and  the  low  mean  value  (13"7)  compared  with 
that  (15)  given  in  botanical  works.  I  was  able  to  collect  127  plants;  of  these,  76 
had  more  than  26  capsules,  some  many  more,  and  of  these  I  took  26  only ;  51  plants 
had  fewer  than  26  capsules,  and  although  I  took  buds  in  every  stage  of  development 
I  could  not  make  up  my  number  to  this,  often  only  obtaining  10,  or  even  fewer.  As 
the  potatoes  were  being  removed  the  mallows  had  to  be  taken  up  before  the  seeds  were 
ripe,  and  the  seed-vessels  artificially  dried.  The  counting  of  the  segments  proved  a 
very  laborious  task.  To  Miss  M.  Notcutt  I  owe  a  considerable  portion  of  the 
counting.  The  buds  and  small  seed  vessels  requiring  a  lens  or  small  microscope  were 
counted  by  Dr.  Lee,  Mr.  Kadford  Sharpe,  and  myself  I  am  responsible  for  the 
tabulation  of  the  data  and  for  all  the  numerical  calculations. 
The  127  plants  gave  rise  to  57,740  pairs  distributed  thus : — 


Co 


Table  XVI. — Malva  Rotundifolia. 

Number  of  Segvients  in  the  First  Seed-vessel. 


*  Misnamed  "  marsh  mallow  "  throughout  this  district  of  Buckinghamshire, 
t  'Cioss  and  Self-feililisatioii  in  Pl;uits,'  .'^rd  edition,  ]).  94 


10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

Totals. 

10 

10 

24 

60 

78 

89 

36 

5 

302 

11 

24 

64 

202 

270 

185 

57 

7 

809 

12 

60 

202 

850 

1802 

1905 

637 

95 

1 

5552 

13 

78 

270 

1802 

5036 

6195 

2409 

324 

8 

0 

5 

16127 

14 

89 

185 

1905 

6195 

9760 

4293 

653 

13 

0 

10 

23103 

15 

36 

57 

6.37 

2409 

4293 

2428 

386 

5 

0 

8 

10259 

16 

5 

7 

95 

324 

653 

386 

60 

2 

0 

2 

1534 

17 

1 

8 

13 

5 

2 

29 

18 

0 

0 

0 

0 

0 

19 

5 

10 

8 

2 

25 

Totals 

302 

809 

5552 

16127 

23103 

10259 

1534 

29 

0 

25 

57740 
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The  following  table  gives  the  chief  numerical  results  : — 


Malva  Rotundifolia. 


Number  of 

Mean. 

S.  D. 

Coefficient 
of 

variation. 

S.  D.  of 
array. 

Per- 
centage 
variation. 

Correla- 
tion. 

Plants. 

Capsules. 

Pairs. 

Pairs. 
Capsules 

}■" 

2568 

5774oi 
I 

13-7000  + 
•0139 
13-6632 

1 • 0407  + 
•0098 
1^0652 

7-7961 

1-0232 

98-32 

•1827  + 
[•0129] 

The  actual  distribution  of  frequency  of  the  capsules  was  : — 


No.  of  segments . 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

Total. 

Frequency     .  . 

18 

41 

263 

736 

994 

445 

68 

2 

0 

1 

2568 

These  results  amply  bear  out  my  a  priori  conclusions. 

The  degree  of  resemblance  of  like  parts  has  been  very  largely  reduced.  The 
variabihty  (77961)  is  smaller  than  any  we  have  yet  come  across,  and  the  variability 
of  the  individual  differs  under  2  per  cent,  from  that  of  the  race.  All  these  conclusions 
are  compatible  wdth  the  decreased  individuality  which  may  be  expected  to  exist 
among  the  members  of  a  group  to  a  large  extent  propagated  by  stolons  and  not 
from  seed. 

Although  in  the  majority  of  cases  (76  out  of  127)  I  took  the  same  number  of  seed 
capsules  (26)  from  the  plant,  still  the  influence  of  the  plants  with  fewer  capsules 
is  quite  marked;  we  see  that  weighting  with  the  number  of  pairs  lowers  the 
variation  and  raises  the  mean,  in  other  words,  the  large  vigorous  plants  have  more 
segmentation  and  less  variabihty.  The  distribution  of  segmental  frequency  is  fairly 
uniform,  and  shows  a  marked  mode  at  14  segments,  not  the  15  of  the  usual 
statement. 

Section  III. — Members  of  Whorls. 

(15.)  Woodruff  (Asperula  odorata).— I  must  confess  that  there  is  divergence 
between  the  standpoints  of  the  botanist  and  of  the  statistician.  To  the  latter  a 
character  is  good  or  bad  according  as  it  aftbrds  facilities  for  fairly  easy  measurement 
or  enumeration.  He  has  first  to  seek  such  characters  as  a  sine  qua  non,  and  then 
inquire  how  far  they  occur  in  undifferentiated  like  organs.  On  the  other  hand,  the 
biologist  \y\\\  readily  provide  a  list  of  such  characters  in  plant  or  animal  life,  and  not 
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one  of  them  be  available  for  measurement  or  counting.  I  had  long  fixed  my  eye  on 
the  members  of  the  whorls  of  the  woodruff.  They  were  so  fascinatingly  easy  to 
collect  and  count  ;  they  exhibited  such  a  moderate  amount  of  variation,  and  one  felt 
sure  the  collecting,  counting,  tabulating,  and  calculating  could  all  be  done  in  the 
inside  of  a  week  ;  and  the  impulse  to  such  a  light  task  after  the  elaborate  work 
on  the  earlier  series  was  very  great.  Professor  F.  O.  Oliver  and  others  warned  me 
that  the  members  of  the  whorls  were  differentiated  in  their  origin*  and  also  in  their 
position  on  the  stem,  and  that  such  a  series  was  unsuited  to  illustrate  the  degree  of 
reseinblance  between  undifferentiated  like  organs.  However,  the  task  seemed, 
comparatively  speaking,  so  easy  that  I  felt  I  would  undertake  it,  if  only  to  compare 
with  the  Nigella  Hispanica,  and  ascertain  in  another  case  how  differentiation  does 
weaken  the  degree  of  resemblance  of  like  parts.  I  accordingly  collected  201  single 
sprays  of  woodruff f  well  distributed  along  about  a  mile  of  lane  on  the  bank  of  a 
hedgerow  facing  south.  These  were  gathered  at  Great  Hampden,  Buckinghamshire. 
In  counting  the  members  on  the  whorls  I  soon  found  evidences  of  differentiation  in 
position,  the  whorls  towards  the  top  of  the  spray  having,  as  a  rule,  fewer  members 
than  those  lower  down.  The  following  is  the  table  of  frequency  of  different  pairs  of 
whorls  : — 


8  j 


Table  XVH. — Woodruff.    Great  Hampden. 
Number  of  Members  on  First  JJliorl. 


4. 

5. 

6. 

7. 

8. 

9. 

10. 

1  Totals. 

1 

4 

8 

- 

8 

41 

7 

1 

I  65 

5 

8 

32 

273 

126 

91 

i  534 

6 

41 

273 

2376 

1363 

1150 

41 

4 

5248 

7 

7 

126 

1363 

1216 

953 

34 

3 

3702 

8 

1 

91 

1150 

953 

1284 

54 

2 

3535 

9 

4 

41 

54 

4 

137 

10 

4 

1 

2 

9 

Totals  ... 

65 

534 

5248 

3702 

3535 

137 

9 

13230 

I  now  give  the  numerical  statement  of  the  constants  deduced  from  the  above  table 
and  the  frequency  distribution  of  the  whorls  : — 


*  Some  of  the  members  of  the  whorls  are  true  leaves  and  others  modified  stipules, 
t  They  were  broken  off  as  close  to  the  ground  as  possible ;  they  included  all  parts  branching  off  above 
ground,  but  the  forked  sprays  were  only  few  in  number. 


326 


PROFESSOR  K.  PEARSON  AND  OTHERS  ON 


Woodruff. 


Number  of 

Mean. 

S.  D. 

Coefficient 
of 

variation. 

S.D. 

of  array. 

Percent- 
age 
variation. 

Correla- 
tion. 

Sprays. 

Whorls. 

Pairs . 

Pairs  ... 
Whorls 

1  201 

1465 

1.323o| 

G-7978  + 
•0164 
G-9010 

•9318  + 
•0116 

■9291 

13 •4631 

•9177 

98-49 

•1733  [  + 
•0171] 

The  frequency  distribution  of  the  1465  whorls  was  : — 


Number  of 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

Total. 

Frequency  .    .  . 

6 

40 

537 

410 

455 

16 

1 

1465 

The  double  modes  6  and  8  are  brought  out  by  the  above  distribution,  and  this  is 
evidence  either  of  differentiation  or  of  a  tendency  to  an  even  number  of  members. 
Contrary  to  the  usual  rule,  the  mean  number  of  members  is  reduced  when  we  take 
pairs,  indicating  that  when  we  take  a  spray  with  many  whorls — such  as  occasionally 
arises  from  forking — the  number  of  members  in  the  whorls  are  reduced,  and  since 
the  variation  is  also  very  slightly  increased  (as  in  Nigel/a),  they  are  spread  out  over 
a  greater  range. 

The  great  reduction  in  the  correlation,  the  value  of  which  is  only  •1733,  shows  how 
unsuited  the  material  was  for  the  purposes  of  the  present  investigation.  At  the  same 
time  the  ratio  of  the  individual  to  the  racial  variability  reaches  the  very  high  value 
of  98*5  !  We  have  selected  a  character  to  test  individuality  on,  which  exhibits 
differences  which  are  largely  racial,  and  not  peculiar  to  the  individual  plant. 

Section  IV. — Ferns. 

(16.)  In  choosing  ferns  for  considering  the  resemblance  between  undifferentiated 
like  organs  I  had  in  view  not  only  the  comparative  ease  of  counting  (as  compared, 
for  example,  with  the  segmentation  of  mallow),  but  the  fact  which  students  of  the 
fern  strongly  impressed  upon  me  that  it  was  peculiarly  subject  to  its  environment. 
One  great  authority  went  so  far  as  to  assert  that  the  presence  or  absence  of  sori  on 
an  individual  hartstongue  depended  solely  on  the  environment  and  not  at  all  on  the 
individuality  of  the  plant.  While  hardly  prepared  to  accept  to  the  full  such  a 
statement  (for  I  had  already  learned  in  a  variety  of  types  of  the  existence  of 
individuality  in  a  marked  manner  apart  from  the  influence  of  environment),  I 
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anticipated  in  accordance  with  it  a  fairly  high  correlation  of  undifferentiated  like 
organs,  and  this  has,  indeed,  turned  out  to  be  the  case. 

(17.)  Somersetshire  Hartstongue  (Scolopendrium  vulgare). — I  owe  the  counting  of 
the  sort  on  8  to  12  fronds  of  each  of  101  hartstongue  ferns  to  Miss  Agnes  Fry.  It 
was  not  always  possible  to  obtain  the  full  suggested  number  of  10  fronds.  In  a  few 
cases  Miss  Fry  also  counted  more  than  10.  But  the  ferns  with  fewer  than  10  are 
not  many,  and  the  distribution  of  the  fronds  is  sensibly  identical  with  that  of  the 
pairs  of  fronds.  The  number  of  sori  was  often  very  large,  running  up  to  180  ;  the 
grouping  of  the  leaves  was  accordingly  arranged  in  units  of  10  sori,  and,  to  save 
fractionising  in  the  table,  these  groups  were  taken  0 — 9-5,  9-5 — 19-5,  19-5 — 29-5,  &c. 
The  grouping  on  the  basis  of  Miss  Fry's  record  was  carried  out  by  Mr.  Leslie 
Bramley-Moore.  With  regard  to  the  counting  of  the  sori,  all  the  sterile  as  well  as 
the  fertile  markings  were  included.  Fronds  with  no  markings  were  included  ;  one 
plant,  the  only  one  found,  with  absolutely  no  markings  on  any  of  the  fronds  was 
excluded,  and  to  this  I  shall  recur  immediately.  It  was  difficult  to  procure  ten  perfect 
fronds ;  many  of  those  gathered  were  found  to  be  more  or  less  torn ;  in  many  cases 
the  loss  of  sori  was  easily  ascertained  ;  in  others  an  estimate  had  to  be  formed.  Such 
estimates  were  made,  however,  in  only  forty-four  cases  out  of  more  than  1,000  fronds, 
and  in  the  majority  of  these  cases  it  meant  the  addition  of  very  few  sori,  and  a  possible 
error  in  the  estimate  of  still  fewer.  Sometimes  one  or  two  sori  may  have  been  missed 
at  the  tip,  where  the  small  markings  are  indistinct,  but  the  errors  due  to  this  source 
as  well  as  to  the  occasional  error  of  a  unit  or  two  in  counting  such  a  large  number 
are  well  under  the  unit  of  grouping  of  ten  sori  finally  selected.  The  environment  of 
the  series  was  not  quite  uniform,  some  thirty-three  coming  from  a  stream  bed  and  the 
remainder  from  a  park  glade.  The  plants  varied  considerably  in  age,  but  the  necessity 
of  procuring  eight  to  twelve  fronds  was  probably  effective  in  excluding  any  quite  young 
ferns.  The  absence  or  presence  of  sori  on  the  frond  was  not  directly  associated  with 
its  age  or  size.  Miss  Fry  has  recorded  a  number  of  large  and  medium  fronds  with 
no  sori,  and  some  of  the  fronds  recorded  as  young  have  occasionally  a  very  con- 
siderable number  of  sori  (as  many  as  ninety-five  on  one  at  least).  Still,  a  glance  at 
the  table  on  p.  330  shows  that  the  fronds  with  zero  sori  form  in  same  way  an  anoma- 
lous group  ;  they  give  a  hump  in  the  frequency  distribution  in  a  manner  somewhat 
similar  to  that  due  to  barren  wives  in  fertility-frequency  curves.*  Dr.  Lee  having 
tabulated  the  sori,  and  calculated  the  constants  on  the  basis  of  including  all  fronds 
with  zero  number  of  sori,  I  went  through  Miss  Fry's  data  again  and  excluded,  not 
all  the  entries  in  the  first  row  and  column  of  Miss  Lee's  table,  but  only  such  of 
them  as  arose  from  the  fronds  with  no  sori.  The  result  was  a  great  smoothing  of 
the  table,  a  raising  of  the  mean,  and  a  lowering  of  the  variability  ;  but  no  appreciable 
effect  whatever  tvas  made  on  the  degree  of  resemblance  of  like  organs,  although  I 

*  See  'The  Chances  of  Death  and  other  Studies  in  Evolution,'  vol.  1,  essay  on  "Reproductive 
Selection,"  and  diagram. 
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We  see  from  the  last  table  how  the  "  hump  "  disappears  from  the  frequency  dis- 
tribution when  we  exclude  the  fronds  without  sort.  The  double  mode  indicates  at 
least  considerable  flatness  at  the  top  of  the  distribution,  and  may  possibly  mark 
some  heterogeneity  of  material.  I  am  inclined,  however,  to  think  that  the  great 
variability  (about  forty-two  even  when  the  fronds  without  son  are  excluded)  is  fully 
accounted  for  by  the  sensitiveness  of  the  fern  in  the  m.atter  of  sorj  to  very  slight 
differences  of  environment.  I  look  upon  the  high  resemblance  of  like  parts  here  as 
having  been  intensified  by  this  cause  ;  upon  an  inherent  individuality  we  have  super- 
posed an  individuality  due  partially,  perhaps,  to  age,  but  largely  to  small  differences 
of  immediate  environment.  Although  the  variation  in  the  race  is  so  large,  and  the 
degree  of  individuality  so  great,  the  ratio  of  individual  to  racial  variability  is  still 
78  per  cent.  Thus  while  the  variability  of  the  hartstongue  (as  far  as  sori  is  concerned) 
is  double  that  of  the  very  variable  holly,  and  almost  thrice  that  of  the  poppy,  still 
the  percentage  variability  of  the  two  latter  species  is  to  that  of  the  former  only  as 
about  85  to  78.  Thus  in  the  most  variable  and  most  individual  species  we  have  yet 
come  across,  we  still  find  the  variation  within  the  individual  is  more  than  three-fourths 
of  the  entire  variation  of  the  race.  In  view  of  facts  like  this,  it  seems  impossible  to 
maintain  the  position  taken  up  by  Mr.  Adam  Sedgwick,  that  variation  is  the  outcome 
of  bi-sexual  reproduction.  The  source  of  variation  exists  within  the  individual  and 
is  extensively  active  without  the  occurrence  of  any  form  of  mating  whatever, 

I  place  here  the  table  for  the  distribution  of  soi  i  in  pairs  of  fronds.  The  numbers 
in  brackets  are  those  which  must  replace  the  unbracketed  numbers,  if  fronds  without 
sori  be  excluded. 
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(18.)  B.  Somersetshire  Ceterach  (Ceterach  officinarum).— Some  time  after  I  had 
tried  in  vain  to  count  any  characters  in  bracken,  I  had  a  letter  from  Miss  Agnes  Fry 
saying  that  she  thought  it  just  possible  that  the  lobes  on  the  fronds  of  ceterach 
might  be  counted.  The  chief  difficulty,  of  course,  was  the  indefinite  character  of 
the  lobes  near  the  tip-a  difficulty  which  had  rendered  the  lobes  in  bracken 
impossible.  The  rather  indefinite  tip  is  here  of  greater  consequence  than  m  the 
hartstongue  or  the  chestnut  leaf,  for  the  total  number  of  lobes  is  comparatively 
small.  In  some  samples  Miss  Fry  sent  me,  however,  I  agreed  fairly  closely  with  her 
estimates,  and  although  the  tip  must  form  a  difficulty,*  we  settled  that  an  attempt 
should  be  made  to  include  ceterach  in  the  present  series.  Miss  Fry  accordingly 
counted  9  to  11  fronds  on  each  of  99  plants.  It  was  necessary  to  take  plants  of 
very  different  ages,  and  even  if  ceterach  be  not  so  sensible  to  its  environment  as 
hartstongue,  it  is  quite  possible  that  part  of  the  correlation  observed  is  due  to 
similarity  of  age.  The  absolute  agreement  of  the  result  obtained  with  that  for 
hartstongue  is  one  of  the  most  striking  things  in  the  whole  collection  of  data. 
Considering  the  difficulty  of  the  tip  in  ceterach  and  the  sensibility  of  the  sori  of 
hartstongue  to  environment,  it  may  be  a  chance  agreement,  but  it  is  certainly  one 
that  gives  ground  for  pause,  and  suggests  further  investigations  of  the  degree  of 
resemblance  between  like  organs  in  ferns.  The  tabulation  of  the  data  and  the  calcu  - 
lation  of  the  constants  are  again  due  to  Dr.  Lee. 

Ceterach. 


Number  of 

Mean  No.  of 
lobes. 

S.  D. 

Coeffi- 
cient of 
variation 

S.  D.  of 

array. 

Per- 
centage 
variation. 

Correlation. 

Plants. 

Fronds. 

Pairs. 

99 

999 

9098 

23-1677  ±-0902 

4-2278  ±-0638 

18-2485 

3-2795 

77-57 

•631ir  +  -0128] 

The  distribution  of  frequency  of  the  fronds  is  almost  the  same  as  that  of  the  pairs, 
since  the  attempt  was  made  to  take  10  fronds;  actually  80  plants  had  10,  14  had 
11,  and  5  only  9  fronds. 

Frequency  from  Pairs. 


No.  of  lobes  . 

,3.! 

14. 

15. 

16. 

! 

17. 

18. 

19. 

20. 

21. 

22. 

23. 

24. 

Frequency  . 

55 

73 

146 

229  1 

j 

375 

418 

584 

575 

811 

726 

800 

805 

No.  of  lobes  . 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 

33. 

34. 

35. 

1 

36. 

37. 

38. 

39. 

Total. 

Frequency  . 

863 

679 

668 

386 

288 

215 

184 

91 

I 

37 

45 

27 

0 

9 

0 

9 

9098 

*  Miss  Fry  writes  :  "  It  is  very  difficult  to  keep  to  a  standard  of  counting  for  the  tip,  but  I  have  tried 
to  do  so." 
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It  will  be  seen  at  once  that  while  in  the  case  of  ceterach  the  variability  has  fallen  to 
less  than  half  that  of  the  hartstongue — being  close  to  that  of  the  Nigella  Hispanica 
and  less  than  that  of  holly— still  the  measure  of  the  individuality,  i.e.,  the  degree  of 
sameness  in  like  organs,  is  to  the  limit  of  accuracy  defined  l)y  its  probable  error 
identical  with  that  of  hartstongue.  Considering  the  difficulties  arising  from  environ- 
ment, difference  of  age,  and  from  standards  of  reckoning,  this  agreement  is  remarkable. 
It  seems  to  me  that  the  basis  of  the  identity,  the  bulk  of  the  agreement,  must  have 
other  sources  than  a  balance  between  the  resemblances  produced  in  undift'erentiated 
like  organs  by  such  diverse  causes  as  environment  on  the  one  hand  and  age  on  the 
other  acting  upon  the  individual  plants.  I  hope  that  further  investigations  upon 
ferns  may  be  forthcoming.    Table  XIX.  contains  the  classified  data  for  ceterach. 

Section  V. — Onions  (Allium  cepa),  Great  Hampden. 

(19.)  The  veins  in  the  successive  tunics  of  onions  occurred  to  me  as  a  possible 
character  for  computation,  no  differentiation  of  the  number  of  veins  with  the 
position  of  the  tunic  being  discoverable.  The  number  of  tunics  on  which  the  veins 
were  countable  was  about  six,  but  the  process  of  counting  itself  was  for  more  than 
one  reason  a  most  trying  one.  As  there  were  so  few  "  like  organs  "  to  each  individual, 
it  was  needful  to  deal  with  at  least  200.  I  accordingly  purchased  about  250  English 
onions  ;  these  were  in  three  groups,  and  came  from  as  many  cottage  gardens.  Their 
average  horizontal  diameter  was  about  2  inches,  and  they  appeared  very  uniform  in 
character.  It  is  quite  possible,  however,  that  there  may  be  some  heterogeneity 
introduced  into  the  series  by  this  mixture  of  onions  from  different  sources,  but  I  did 
not  at  the  time  see  my  way  to  obtaining  the  whole  series  from  one  homogeneous 
sowing.*  The  entire  work  of  counting  and  calculating  on  200  of  these  onions  was 
most  valiantly  undertaken  by  Dr.  LEE.f  Each  onion  was  cut  through  horizontally  ; 
the  tunics  then  came  easily  apart,  and  these  were  held  up  to  the  light,  and  the 
number  of  veins,  ranging  from  18  to  54,  then  counted  and  recorded.  The  following 
table  contains  the  chief  numerical  constants  : — 


*  The  onions  were  purchcised  at  an  auction  following  a  harvest  festival ;  the  great  bulk  of  them  had 
formed  a  large  chancel  crucifix  constructed  of  two  different  species  from  one  cottage  garden.  The 
English  onions  from  thi^  crucifix  formed  a  most  homogeneous  group,  having  been  already  selected  for 
uniformity  of  size  and  appearance.  I  supplemented  these  by  the  purchase  of  two  other  smaller 
"offerings,"  closely  resembling  them. 

t  If  any  of  my  readers  wish  to  appreciate  the  purely  phijikal  difficulties  of  the  task,  they  should  try 
the  effect  of  slicing  and  counting  four  or  five  onions,  and  they  will,  I  think,  be  inclined,  as  I  soon  was,  to 
give  up  the  task  in  despair. 
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English  Onion. 


Number  of 

Mean  No. 

S.  D. 

CoefRcient 
of 

variation. 

S.  D.  of 

Percentage 

Correla- 

Plants. 

Tunics. 

Pairs. 

of  veins. 

array. 

variation. 

tion. 

200 

1085 

4924 

29-8725 
±  -1066 

5 • 2060 
±  -0754 

17-4274 

4  - 1222 

79-18 

•6108 
[±  -0128] 

The  actual  number  of  tunics  with  veins  countable  was  made  up  as  follows : 
121  onions  with  6,  44  Avith  5,  34  with  4,  and  1  with  only  3  available  tunics.  It  will  be 
seen  that  the  onion  has  a  degree  of  resemblance  in  its  undifferentiated-like  organs 
closely  resembling  that  of  the  Chelsea  Shirley  poppies,  of  the  Dorsetshire  hollies 
(without  No.  91),  or  the  mixed  Simnish  chestnuts,  and  somewhat  less  than  that  found 
for  the  two  series  of  ferns.  Its  variability  approaches  closely  that  of  the  wild  poppy. 
On  the  whole  it  fits  well  into  our  series  of  results,  the  somewhat  high  value  of  the 
correlation  approaching  those  of  several  sets,  the  homogeneity  of  which  is  not  entirely 
above  suspicion.  The  accompanying  table  gives  the  classified  data  from  which  the 
constants  were  deduced. 

Section  VI.- — Leguminous  Plants. 

(20.)  I  had  at  an  early  stage  considered  that  the  counting  of  seeds  in  the  pods  of 
leguminous  plants  would  be  a  very  easy  manner  of  testing  the  intensity  of  homo- 
typic  correlation.  But  as  the  number  of  seeds  fully  developed  would  depend  on 
the  extent  to  which  the  flower  had  been  fertilised,  it  seemed  to  me,  after  further 
thinking  about  the  problem,  that  unless  I  counted  all  the  abortive  as  well  as  the  fully 
developed  seeds,  I  should  find  the  factor  of  homotyposis  weakened  by  the  external 
chances  upon  which  I  thought  cross-fertilisation  must  depend,  and  which  I  feared  would 
have  nothing  in  the  main  to  do  with  the  individuality  of  the  plant.  I  made  some 
attempts  to  count  all  abortive  as  well  as  fully  developed  seeds  in  certain  species,  but  I 
found  the  task  not  only  very  laborious,  but  my  estimates  doubtful.  Examining  for 
another  purpose  Darwin's  '  Cross-  and  Self-Fertilisation  of  Plants.'  I  was  struck  by 
the  fact  that  he  did  not  liesitate  for  the  purpose  of  comparing  their  vigour  to  count 
the  seeds  of  plants  living  in  the  open  and  all  alike  subject  to  free  visitation  from 
insects  and  bees.  He  distinctly  states  (3rd  Edn.,  p.  115)  that  "  the  difference  in  the 
number  of  the  contained  seeds*  must  depend  u2)on  the  constitution  of  the  plants." 
This  view  of  the  matter  impressed  me  without  entirely  removing  my  earlier  doubts. 
Given  a  number  of  plants  living  under  the  like  conditions,  and  with  a  superabundant 
visitation  from  insects,  there  would  be  ample  and  equal  chance  of  fertilisation  for  all 

*  Darwin  in  all  cases  appears  to  have  calculated  the  good  or  fully  developed  seed. 
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of  them,  and  excess  or  defect  from  the  average  number  of  fertile  seeds  would  depend 
only  on  the  constitution  of  the  individual  plant.  It  would  seem  accordingly  that  in 
the  case  of  cross-fertilisation  ample  visitation  was  a  si7ie  qud  non,  and  this  led  me  to 
select  broom  in  the  first  place. 

Broom  (Cytisus  Scoparius),  Danhy  Dale.— 'We  collected  10  pods  h'om  each  of  120 
broom  bushes.  These  were  growing  within  some  quarter  of  a  mile  of  each  other,  on 
the  roadside  at  Botton,  towards  the  head  of  Danby  Dale,  Yorkshire. 

I  roughly  estimated  that  if  abortive  seeds  were  to  be  included,  16  would  be  about 
the  modal  number  of  ovules  ;  the  actual  average  of  fully  developed  seeds  in  1200  pods 
was  9-6425,  with  a  modal  value  at  9,  so  that  some  6  to  7  seeds  failed  of  fertilisation 
on  the  average  in  each  pod.  The  following  is  the  frequency  distribution  of  the  seeds 
in  the  pods  : — 


No.  of  seeds  . 

1. 

2. 

3. 

4. 

5. 

6. 

^. 

8. 

9. 

10. 

11. 

12. 

13, 

14. 

15. 

]6. 

17. 

18. 

19. 

20. 

Total. 

Frequency  . 

8 

15 

29 

47 

51 

77 

98 

126 

136 

134 

103 

100 

67 

48 

33 

17 

5 

4 

1 

1200 

I  came  across  no  pods  with  an  entire  absence  ot  seeds,  although  such  may  exist.  I 
refer  to  this  because  the  reader  might  think  that  such  were  discarded.  The  distribu- 
tion is  fairly  regular,  but  it  is  clear  that  the  variation  is  very  large,  the  distribution 
being  very  flat  topped.  As  in  nearly  all  the  cases  that  I  have  dealt  with  in  this 
memoir,  it  is  markedly  skew.  I  reserve,  however,  for  the  present  the  full  considera- 
tion of  variation  in  the  vegetable  world,  as  my  data  extend  far  beyond  the  material 
considered  in  this  paper,  which  is  limited  to  the  cases  in  which  the  homotypic  corre- 
lation has  also  been  worked  out^ — a  far  more  laborious  enquiry,  I  would  only  remark 
that  among  plants  and  trees  I  know  of  nothing  approximating  even  to  the  "  normal 
law,"  and  that  in  many  cases  we  appear  to  have  mixtures  of  local  races  hardly  yet 
differentiated  by  the  botanist. 

The  following  table  gives  the  constants  for  broom  : — 


Broom.    Seeds  in  Pods. 


Number  of 

Mean  No. 

S.  D.  of 

Coefficient 
of 

variation. 

S.  D.  of 

Percentage 

Correla- 

Plants. 

Pods. 

Pairs. 

of  seeds. 

seeds. 

array. 

variation. 

tion. 

120 

1200 

10,800 

9-6425 
±  -0691 

3-54655 
±  -0488 

36  -  7804 

3-22595 

90-96 

-4155 
[±  -0161] 

The  actual  distribution  of  pairs  is  given  in  the  table  on  the  following  page.  It 
will  be  seen  that  some  of  the  arrays  are  rather  irregular,  but  in  its  results  I  look 
upon  broom  to  be  as  satisfactory  as  any  material  I  have  dealt  with.  In  the  first 
place,  the  table  fully  bears  out  the  conclusion  drawn  from  the  simple  frequency 
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distribution  of  the  high  degree  of  variability  in  the  broom  pod.  We  have  so  far 
reached  nothing  but  the  sori  on  the  Hartstongue  fern  with  anything  Hke  such  a  high 
coefficient  of  variation.  We  might  expect,  if  Mr.  Adam  Sedgewick's  view  were 
correct  and  great  variation  meant  small  intensity  of  heredity,  that  homotyposis,  of 
which  heredity  is  only  a  special  case,  would  also  be  small  if  variation  were  large. 
But  we  find  nothing  of  the  kind.  The  resemblance  of  undifferentiated  like  organs 
reaches  the  value  -4155,  fully  equal  to  that  of  the  piyiim  of  the  ash,  which  have  less 
than  half  the  variability  of  broom  pods.  This  value  is  close  also  to  the  '4000  given 
by  the  Ancestral  Law  of  Heredity  for  the  degree  of  resemblance  between  brothers. 
In  the  next  place  this  normal  behaviour  as  to  the  degree  of  resemblance  of  like 
organs  is  associated  with  a  variability  in  the  individual  which  amounts  to  upwards  of 
90  per  cent,  of  the  racial  variability.  It  is  impossible  to  form  standard  deviations 
for  groups  of  10,  but  if  the  reader  will  reduce  any  array  of  the  above  table  to  the 
number  10,  rejecting  all  fractions  of  unity,  he  will  obtain  quite  fair  samples  of 
what  I  actually  found  in  the  case  of  individual  pods,  I  think  we  may  feel  fairly 
confident  that  the  variability  of  a  race  is  not  in  inverse  proportion  to  either  its 
heredity  or  its  homotyposis. 

It  will  be  seen  that  broom,  so  far  as  it  goes,  supports  the  view  held  by  Darwin 
that  the  number  of  ripe  seeds  is  a  measure  of  individual  constitution.  At  the  same 
time  the  question  of  self-fertilisation  arises.  Would  completely  self-fertilised  plants 
exhibit  full  homotyposis  ?  May  not  outside  influences — wind,  shaking  due  to  visits 
of  insects  or  other  causes  of  a  random  character — be  also  needful  for  the  ripening  of 
the  seeds  even  in  the  case  of  self-fertilisation  ?  Again,  can  we  always  suppose  that  a 
plentiful  visitation  will  take  place  in  the  case  of  all  cross-fertilised  plants  ?  I  must 
confess  that  the  value  for  the  homotyposis  found  in  the  case  of  broom  did  not  suffice 
to  remove  all  my  doubts  as  to  any  character  depending  on  fertilisation  being  a 
suitable  one  for  the  determination  of  the  intensity  of  homotyposis.  The  subject, 
however,  is  one  of  such  great  interest  that  it  deserves  an  independent  and  fuller 
treatment  than  can  be  provided  here."* 

V.  Cross-homotypic  Correlation. 

(21.)  Mushroom  (Agaricus  campestris). — A  more  complete  study  was  now  made  of 
two  species — the  mushroom  and  the  ivy.  We  found  some  difficulty  in  discovering 
two  easily  measurable  or  countable  characters  in  one  organ,  neither  of  which  was 
largely  influenced  by  the  growth  of  the  organ  or  the  age  of  the  organism.  In  select- 
ing an  organ  in  the  mushroom,  I  was  guided  by  the  desire  to  take  a  simple  organism, 
and  an  organ  upon  which  fertilisation  had  no  influence.  The  gill  of  the  mushroom 
seemed  to  satisfy  to  some  extent  these  conditions.  Had  we  been  able  to  grow  our 
own  mushrooms  we  might  have  succeeded  in  taking  them  all  at  the  same  stage  of 

*  This  has  been  to  some  extent  provided  in  an  Appendix  to  this  memoir,  added  since  its  completion, 
VOL.   CXCVII.  A,  2  X 
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development,  and  our  results  would  in  this  case  have  been  more  homogeneous  and 
thus  more  satisfactory.  But  simply  gathering  them  m  the  fields  we  could  only  make 
a  very  rough  approximation  to  uniformity  in  age  or  development.  We  aimed  at  a 
condition  in  which  the  underskin  had  disappeared  and  the  gills  had  lost  all  signs  of  a 
crease  or  notch.  Dr.  Lee  and  I  collected  more  than  100  mushrooms  in  the  Great 
and  Little  Hampdens.  With  the  assistance  of  my  wife  we  cut  10  gills  out  of  each 
of  these,  distributed,  as  far  as  the  condition  of  the  mushroom  would  allow,  uniformly 
round  the  axis.  These  gills  were  then  placed  on  ruled  paper,  the  horizontal  lines 
of  which  served  to  mark  the  maximum  length  of  the  gill.  This  was  marked  by 
two  fine  needle-pricks,  one  at  either  end.  Two  further  needle-pricks  were  now  made 
at  the  points  farthest  removed  from  this  length  line,  one  on  the  upper  and  one  on 
the  lower  contour  of  the  gill.  Thus  by  drawing  through  the  four  needle  points  lines 
parallel  and  perpendicular  to  the  paper-ruling  we  obtained  a  rectangle  approxi- 
mately circumscribing  the  gill.  The  length  and  breadth  of  this  rectangle  are 
what  I  shall  term  the  length  and  breadth  of  the  gill.  At  first  I  endeavoured 
to  take  these  measurements  by  the  use  of  a  micrometer  microscope,  but  this 
method  was  very  laborious,  and  gave  an  apjDarent  exactness  wholly  out  of  keeping 
with  the  somewhat  rough  and  ready  manipulation  of  the  gill.  Even  when 
the  gill  itself  was  placed  under  the  microscope  the  gill  length  could  only  be  placed 
parallel  to  the  spider  wires  by  a  somewhat  vague  appreciation,  and  the  uj)per 
boundary  of  the  gill  was  under  the  lens  rather  less  definite  than  without  it.  Accord- 
ingly we  resorted  to  the  needle-pricks  and  the  ruling  by  fine  lines  of  the  above- 
mentioned  rectangle.  I  determined,  after  various  trials  with  engraved  glass  scales, 
&c.,  the  length  and  breadth  of  these  rectangles  by  the  use  of  a  pair  of  projDortional 
compasses,  set  to  multiply  by  ten,  and  an  ordinary  millimetre  scale.  I  was  thus  able 
to  get  readings  to  "01  of  a  millimetre,  which  agreed  well  on  rejDetition.  As  the 
millimetre  itself  was  afterwards  adopted  as  the  unit  of  grouping,  I  do  not  think  the 
errors  of  manipulation  will  seriously  affect  our  results.  The  needle-prick  method 
enabled  us  to  deal  with  the  mushrooms  quite  fresh  and  reserve  the  measuring  till 
another  season.*  Dr.  Lee  found  what  we  may  term  the  gill-index,  i.e.,  the  ratio  of 
length  to  breadth  for  the  1070  gills,  by  aid  of  a  Fuller's  slide  rule.  We  had  thus  three 
characters  to  deal  with — length,  breadth,  and  index.  Of  these  we  considered  that 
the  first  two  would  uiidoubtedly  be  largely  influenced  by  the  stage  of  development  of 
the  mushroom,  but  we  hoped  that  the  index  (like,  for  example,  the  cephalic  index  of 
children  over  two  years  of  age)  would  not  be  nmch  influenced  by  the  growth  factor. 
On  the  other  hand,  the  correlation  of  lengths  of  gills  or  the  correlation  of  breadths  ot 
gills  from  the  same  mushroom  would  of  course  be  influenced  by  growth  as  well  as 

*  In  all  such  systems  of  mass-measurement,  the  method  adopted  has  to  be  chosen  with  due  regard  to 
time  and  eyesight.  The  actual  measurement  of  the  gills  occupied  me  about  ten  full  days,  the  microscopic 
method  would  have  taken  about  four  times  the  time,  even  if  my  eyes  would  allow  of  such  continuous 
working  and  the  mushrooms  could  have  been  preserved  fresh. 
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by  individuality.  Thus  we  should  find  the  degree  of  resemblance  between  two  gills  of 
the  same  mushroom,  when  measured  by  length  or  l^readth,  to  be  influenced  not  only 
by  the  individuality  of  the  mushroom,  but  by  the  stage  of  its  growth,  and 
accordingly  likely  to  exceed,  and  possibly  considerably  exceed,  the  value  about  "5, 
which  we  have  determined  in  several  other  cases/"' 

In  order  to  make  a  more  complete  study  of  the  influence  of  stages  of  growth  on 
the  degree  of  resemblance  of  undiff'erentiated  like  organs.  Dr.  Lee  with  her 
characteristic  energy  undertook  the  five  needful  tables. 

They  are  the  following  - 

(i.)  Organic  correlation  of  the  length  and  breadth  of  the  same  gill, 
(ii.)  Homotypic  correlation  of  the  lengths  of  two  gills  of  the  same  mushroom, 
(iii.)  Homotypic  correlation  of  the  breadths  of  two  gills  of  the  same  mushroom, 
(iv.)  Cross  homotypic  correlation  of  the  length  of  one,  and  the  breadth  of  a  second 

gill  of  the  same  mushroom, 
(v.)  Homotypic  correlation  of  the  indices  of  two  gills  of  the  same  mushroom. 

Of  these  tables  the  cross  homotypic  correlation  between  length  and  breadth  is  a 
very  laborious  business,  offering  no  means  of  readily  testing  the  accuracy  of  the 
entries  until  the  whole  table  is  completed  and  its  columns  and  rows  added  up. 
Ultimately  the  frequency  distributions  (as  given  by  row  or  column  marked  totals) 
for  the  lengths  and  breadths  in  tables  (ii.),  (iii.),  and  (iv.),  ought  to  he  n  —  I  times  the 
frequency  distributions  of  lengths  and  breadths  as  given  by  the  like  row  and  column 
in  the  organic  correlation  table  (i.),  11  being  the  number  of  organs  measured  in  each 
individual ;  in  our  case  this  is  ten. 

I  begin  by  considering  the  results  obtained  for  the  degree  of  resemblance  of  gill- 
indices,  i.e.,  Table  (v.)  referred  to  above.    The  following  were  the  results  obtained  : — 


Mushroom.    Gill  Indices. 


Number  of 

Mean  index. 

S.D. 

Coefficient 
of 

variation. 

S.  D.  of 

array. 

Percentage 
variation. 

Correlation. 

Mushrooms. 

Gills. 

Pairs. 

107 

1070 

9630 

2-8695 
±  -0108 

-5246 
±  -0076 

18-2819 

■4385 

83-58 

-5490 
[±  -0144] 

The  frequency  distribution  of  the  gills  was  the  following  :  — 

*  The  absurdity  ok"  disregarding  the  growth-factor  may  be  realised  if  we  consider  how  much  the  degree 
of  resemblance  in  brethren  would  Ije  increased  if  we  took  pairs  of  brothers,  differing,  say,  by  less  than  two 
years,  and  these  pairs  were  scattered  at  all  ages  from  four  to  twenty  ! 

2x2 
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Gill  Index. 
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CO 
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31 
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8 

3 

3 

4 
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0 

3 

1 

2 
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1 

0 

1 

0 

1 

0 

1 

1070 

This  distriljution  is  rather  irregular  and  may  suggest  heterogeneity  ;  the  tail  is 
somewhat  inordinately  extended.  But  the  numerical  constants  given  in  the  table 
above  are  quite  comparable  with  the  best  of  our  other  series,  and  are  rather  better 
than  I  had  anticipated  a  'priori,  for  I  expected  the  diverse  stages  of  growth  would 
influence  the  index  as  well  as  the  absolute  measures.  It  would  be  of  great  interest 
to  repeat  the  series  on  an  artificially  grown  bed  of  mushrooms,  taking  each  at  the 
same  stage. 

It  will  be  seen  that  the  mushroom,  with  degree  of  resemblance  of  like  organs 
aljout  that  of  the  Spanish  chestnut,  has  a  variability  comparable  with  that  of  the 
ash.  The  variation  of  the  individual  is  even  in  such  a  comparatively  simple  organism 
more  than  80  p.c.  of  that  of  the  race.  It  will  thus  be  clear  that  the  mushroom 
takes,  so  far  as  our  present  investigation  is  concerned,  no  special  place  apart  in  the 
vegetable  world,  but  quantitatively  is  comparable  with  highly  developed  organism^ 
like  trees.  The  accompanying  table,  Table  XXII.,  gives  the  data  for  the  measurement 
of  the  gills. 

I  now  turn  to  the  relationships  of  the  absolute  measurements,  which,  as  I  have 
noted,  we  have  good  grounds  for  believing  are  much  affected  by  growth. 

The  numerical  constants  are  summed  up  in  the  following  table,  the  dimensions 
being  given  in  millimetres  : — 


Table  XXII — Mushroom  Gill-indices. 

Imlex  of  Fird  Gill. 


1-6495 
1-6495 
1-7495 
1-1 

1-  9495 

2-  0495 
2-1495 
2-2496 

i-3495 
2-4495 
2-6495 

2-  6495 
!  2-7495 

/  3-8495  j 
I  2-9496 
3-0496 
3-1495 

3-  2495  ' 
>  3-3495 

I  3-4496 
^1  3-5495 
,  3-6495 
;  3-7495 

3-  8495 
I  3-9496 
j  4-0495 

4-  1496 
:  4-2495 
r 4-3495 
j  4-4495 

4-  5495 
4-6495 
4 -7495 
4-8495 

4-  9496 

5-  0495 
-1495 

I  5-2495 
3-3496 
5-4495  ■ 
5-5495 

'  5-6495 


1-6495.    1-7496.  1-8495 


■1495.    3-240.5.    2-3496.  2-4495, 


i  Totals., 


0 
2 
10 
9 
16 
20 
20 
36 
35 
26 
29 
8 
16 
12 
7 
6 
2 
3 
2 
4 
2 


1 
6 
G 
11 
26 
35 
40 
42 
28 
44 
28 
32 
16 
8 
6 
7 
6 
1 
3 
3 
0 
3 
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75 
62 
79 
38 
46 
27 
13 
13 
7 
12 


72 
102 
124 
124 
104 
90 
47 
50 
32 
17 
15 
11 
9 
5 
3 
1 
4 


38 
47 
73 
124 
138 
100 
84 
60 
42 
36 
17 
10 
4 
11 
1 
2 
4 
7 
0 


1 
0 
0 
2 
3 
6 
16 
32 
44 
62 
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96 
78 
64 
69 
29 
23 
18 
24 
3 
13 
1 
3 
0 
1 
0 
1 
2 
0 
I 
0 

1 

0 
0 
0 
0 

0 

1 


0 
0 

1 

12 
16 
29 
79 
90 
84 
96 
98 
74 
63 
60 
26 
34 
16 
23 
9 
9 
3 
3 
0 


47 
50 
78 
74 
68 
66 
50 
27 
43 

44 
10 
13 
5 
2 
0 
2 
0 
0 
1 
0 
3 
0 
1 
0 
0 
0 
0 
0 
1 


0 
3 
2 
5 
6 

16 
46 
60 
42 
54 
63 
66 
44 
65 
32 
25 
20 
29 
16 
10 

6 

7 

2 

4 

0 

0 

3 

0 

0 

0 

0 

0 

0 

0 

0 


0 
0 
0 
2 
7 
6 
27 
32 
36 
69 
60 
50 
65 
48 
31 
40 
26 
22 
12 


13 
17 
17 
29 
26 
27 
32 
31 
18 
20 
23 
20 
13 
21 
10 


0 
1 
0 
2 
1 
7 
13 
15 
10 
23 
34 
43 
26 
40 
20 
28 
22 
42 
13 
12 
9 
6 
0 
4 
0 
4 
2 
0 
3 
1 
3 
0 
1 
0 
1 
0 

1 

0 

1 


1 
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3 
4 

3  ; 
4 
7 
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63 

- 

- 
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— 

— 
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— 
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— 
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1 

— 

— 

— 

— 

— 

— 

_ 
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— 

— 

— 

- 

- 

0 

- 

— 
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— 

— 

— 
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4 

0 

0 

0 

0 

0 

— 
0 
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0 
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0 
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1_ 
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0 

0 

0 

0 

0 

0 

0 

3 
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0 

I 

0 

1 

0 

0 

0 

0 
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1 
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0 

0 

0 
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0 

0 
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2 

0 

3 
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0 
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0 
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1 

0 

2 

0 

2 

0 

0 

0 

2 

81 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

9 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

72 

0 

0 

0 

0 

0 

0 

0 

0 

0 
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0 
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0 

0 

0 
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9630 

N.B.  1-5495  denotes  tlie  gi'oiip  formed  of  all  indices  fioni  1-4995  to  1-5995,  or  since  the  ratio,  length/breadth,  was  formed  to  three  decimal  places,  all  values  of  the  index  from  1  -500  to  1  -599  inclusive. 
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Absolute  Dimensions  of  Mushroom  Gills. 


Character. 

Mean. 

S.  D. 

Coefficient 
of  variation. 

S.  D.  of 
array. 

Percentage 
variation. 

Direct  homo- 
typic  correlation. 

Length.  . 
Breadth  . 

19-1847  +  -0831 
6-8529  ±  -0331 

4-0282  +  -0587 
1-6047  ±  -0234 

20-9969 
23-4164 

2-0510 
1-0859 

50-92 
67-67 

-8607  [±  -0053] 
-7363  [±  -0094] 

Cross-correlations. 


Length  and  breadth,  organic 
Length  and  bi'eadth,  homotypic 


•7000  ±  -0105 
■6275[±  •0125] 


The  results  here  given  seem  to  be  of  considerable  suggestiveness.  While  the  two 
means  and  standard  deviations  dilier  very  considerably,  the  coefficients  of  variation 
for  the  length  and  breadth  of  the  gill  are  approximately  equal,  and  in  addition 
;very  high.  This  is  precisely  what  we  might  expect  when  dealing  with  a  series 
of  organisms  in  various  stages  of  growth.  Further,  the  variability  of  the  individual 
as  compared  with  that  of  tlie  race  is  reduced  immensely  below  the  80  to  90  per  cent, 
with  which  our  previous  investigations  have  made  us  familiar.  This  might  again  be 
anticipated  as  a  result  of  heterogeneity  in  the  stage  of  growth.  The  organic  correla- 
tion of  length  and  breadth,  as  well  as  the  homotypic  correlations  of  the  same  two 
characters,  both  direct  and  cross,  are  all  very  high,  and  the  latter  are  much  beyond 
what  we  might  legitimately  j)ut  to  the  credit  of  the  pure  homotyposis  factor.  It  seems, 
therefore,  somewhat  hazardous  to  consider  how  far  they  satisfy  the  relationship  sug- 
gested in  the  introductory  part  of  this  paper.  The  direct  homotypic  correlations  for 
length  and  breadth  are  not  approximately  equal  as  we  might  exjject ;  this  suggests 
that  growth  may  be  a  somewhat  more  marked  factor  in  the  length  than  the  breadth 
of  the  gill.  If  we  take  the  product  of  direct  homotypic  correlation  with  the  organic 
correlation,  this  ought,  on  our  hypothesis,  to  give  us  roughly  the  cross-homotyjaic 
correlation.    We  have  the  following  results  : — 

Product  of  direct  homotypic  length  correlation  with  organic  length - 

breadth  correlation  =  -6025. 

Product  of  direct  homotypic  breadth  correlation  with  organic  length- 
breadth  correlation  =  '5154. 

Product  of  meai]  of  direct  homotypic  breadth  and  length  correlation  with 

organic  length-breadth  correlation  .    ,  =  -5590. 


These  numbers  have  to  be  compared  with  the  cross  homotypic  length-breadth 
correlation,  i.e.,  with  "6275. 
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It  will  be  seen  at  once  that  they  are  sensibly  too  small  (say,  '56  as  compared  with 
■63).  But  having  recognised  the  influence  of  heterogeneity  in  the  growth  stages 
of  our  material,  the  divergence  is,  I  take  it,  of  a  magnitude  rather  to  confirm  than 
confute  the  hypothesis  as  to  the  relationship  of  direct  and  cross  homotypic  correla- 
tions. It  must  be  remembered  that  we  are  dealing  with  average  results.  Our  main 
proposition  is  that  heredity  is  not  a  factor  of  life  j^eculiar  to  sexual  reproduction,  but 
merely  a  phase  of  the  larger  factor,  which  we  have  termed  homotyposis,  or  the 
tendency  of  the  individual  to  put  forth  undifferentiated  like  organs  with  a  certain 
degree  of  resemblance.  Our  subsidiary  proposition  is  :  that  if  the  direct  homotypic 
correlation  oscillates  about  a  certain  mean  value,  and  the  cross  homotypic  correlation 
be  the  product  of  organic  and  direct  homotypic  correlations,  then  we  should  expect 
to  find  the  average  degree  of  resemblance  of  l)rethren  equal  to  the  average  degree 
of  resemblance  of  undifferentiated  like  organs  in  the  individual. 

All  then  I  think  we  can  safely  say  for  the  gills  of  mushrooms  is  that  the  relation  of 
the  direct  and  cross  homotypic  correlations  is  by  no  means  such  that  it  condemns 
our  hypothesis,  or  enforces  us  to  reject  our  subsidiary  jjroposition.  The  difficulty  lies  in 
finding  adult  organisms  with  undifferentiated  like  organs  with  two  characters  in 
sufiicient  quantity,  easily  counted  or  measured,  uj^on  which  the  hypothesis  can  be  tested. 
The  investigation  of  the  length  and  breadth  of  ivy  leaves,  to  which  we  shall  soon  turn, 
suffers  to  some  extent  from  the  same  defects  as  that  for  the  gills  of  mushrooms. 

Tables  XXIII.-XXVI,  contain  the  length -breadth  data  for  the  mushrooms.  The 
frequency  distributions  for  length  and  breadth  are  given  in  the  last  row  and  column 
of  Table  XXY.  above,  and  exhibit  in  their  irregularity  something  of  the  hetero- 
geneity of  growth  to  which  I  have  referred.  Of  the  two  distributions,  I  consider 
that  for  the  breadth  as  the  more  irregular  and  consequently  the  less  satisfactory.  It 
is,  of  course,  harder  to  determine  a  definite  breadth  for  the  gill  than  a  definite 
length,  and  I  should  be  well  content  to  compare  the  product  of  the  homotyj)ic  length 
correlation  and  the  organic  length-breadth  correlation,  i.e.,  "6025,  with  the  cross 
homotypic  length-breadth  correlation,  i.e.,  "6275,  as  the  best  basis  for  our  subsidiary 
proposition  available  from  these  measurements  on  mushrooms. 

(22.)  Wild  Ivy  (Hedera  Helix). — ^This  series  was  originally  undertaken  by  Dr. 
E.  Warren,  his  object  being  to  measure  the  degree  of  resemblance  between  ivy 
leaves  on  the  same  plant,  using  as  his  character  the  index,  or  ratio  of  length  to 
maximum  breadth.  It  was  hoped  that  in  this  manner,  the  growth  factor  might  be 
fairly  well  eliminated. 

The  figures  (page  348)  indicate  how  the  breadth  and  length  were  determined  by 
Dr.  Warren  in  leaves  of  somewhat  different  shape. 

It  will  be  seen  that,  as  in  the  case  of  the  gill  of  the  mushroom,  the  breadth  is  taken 
as  the  maximum  breadth  between  tangents  to  the  contour  parallel  to  the  length, 
and  these  tangents  do  not  necessarily  go  through  points  like  a,  a.  Twenty-five  leaves 
were  taken  from  each  of  42  plants  by  Dr.  Warren,  in  the  neighbourhood  of  Canterbury, 
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and  25  leaves  from  each  of  58  plants  by  me,  in  the  neighbourhood  of  Great  Hampden 
in  Buckinghamshire.  Care  was  taken  to  avoid  the  reproductive  shoots  with  their 
larger  uncut  leaves.  The  Hampden  plants  were,  I  feel  certain,  a  fairly  homogeneous 
group,  the  plants  were  young  runners  up  the  trees,  only  a  few  feet  in  length.  The 
Canterbury  plants  appear  to  have  had  a  somewhat  larger  leaf,  although  in  themselves 
an  apparently  homogeneous  group. 


It  would  undoubtedly  have  been  better  to  have  had  the  leaves  collected  by  one 
person  from  one  district  ;  but  until  the  leaves  had  all  been  measured  and  their 
indices  found,  I  did  not  know  of  this  difference  between  the  two  groups.  It  was  very 
little  noticeable  in  the  table  of  index  distribution,  but  became  manifest  to  me  when 
finding  the  homotypic  correlation  table  of  the  absolute  lengths,  which  I  did  in  two 
sections,  one  for  either  group.  The  very  considerable  labour  involved  in  measuring 
and  determining  the  indices  of  2500  leaves  precluded  a  repetition  of  the  work  on 
more  homogeneous  material.  The  whole  of  this  labour  was  undertaken  by  Dr. 
Warren,  who  further  formed  the  table  and  determined  the  correlation  coefficient.* 

I  give  in  the  following  scheme  the  frequency  distribution  of  the  indices  to  show 
its  uniformity. 


Index. 

1  . 

03  03 
l>J  TO 

•395- 
■495. 

1  . 

o  o> 
^  ira 

1  . 

lO  X-O 
03  05 

ira  CO 
249 

■695- 
■795. 

I  ^795- 
■895. 

•895- 
•995. 

•995- 
1-095. 

1  095- 
1-195. 

1-195- 
1-295. 

1  . 

to  »o 
a-.  03 

0-1  C3 
r-<  rH 

1  . 

U5  1/3 
03  03 
CO  ■* 

1—1  1—1 

1  -495- 
1^595. 

1  . 
Id 

OS  Oi 
lO  CO 

rH  r-l 

Total. 

Frequency 

2 

10 

38 

674 

738 

436 

207 

86 

32 

16 

8 

3 

1 

2500 

*  After  measuring  the  gills  of  only  1070  mushrooms,  I  very  fully  appreciate  Dr.  Warren's  task  on 
the  2.500  ivy  leaves  ! 


HOMOTYPOSIS  IN  THE  VEGETABLE  KINGDOM.  349 

Of  course  the  groupings  here  are  large,  but  the  distribution  notwithstanding  the 
mixture  of  material  seems  more  regular  than  in  some  of  our  other  series. 
The  following  scheme  gives  the  constants  in  the  usual  manner  : 


Leaf  Index  of  Wild  Ivy. 


Number  of 

Mean 

S.  D. 

Coefficient 
of 

variation. 

S.  D.  of 

Percentage 

Correlation. 

Plants. 

Leaves. 

Pairs. 

index. 

array. 

variation. 

100 

2500 

60,000 

•8473 
±  -0020 

•1506 
±  -0014 

17-7735 

•1449 

96^21 

•2726 
[±  -0125] 

It  will  be  observed  that  the  correlation  is  lower  than  we  might  have  expected, 
although  the  mean  of  the  index  correlations  for  mushroom  gills  and  ivy  leaves,  i.e.^ 
^  ('5490  +  "2726)  ="4108,  is  almost  identical  with  the  value  "4  given  for  brothers  by 
the  law  of  ancestral  heredity.  The  low  value  of  the  correlation  gives  of  course  a 
high  value  to  the  percentage  variation,  the  variation  of  the  individual  plant  being 
within  4  per  cent,  of  the  racial  variation.  The  variability  of  the  ivy  leaf,  as  judged 
by  this  character,  has  not  the  intensity  which  is  popularly  associated  with  it ;  it  is 
sensibly  less  than  that  of  the  mushroom  gill,  and  about  equal  to  that  of  the  tunics 
of  the  onion. 

I  now  joass  to  the  absolute  measurements  of  the  leaf. 

Dr.  Warren  being  much  pressed  with  other  work,  kindly  placed  all  his  measure- 
ments at  my  disposal,  and  I  proceeded  to  draw  up  the  same  four  tables  as  in  the 
case  of  the  mushroom  gills.  The  work  here  was,  however,  much  more  laborious  as  I 
had  to  deal  with  25  leaves  instead  of  10  gills,  and  this  involved  300  pairs  for  each 
plant,  instead  of  only  45  !  In  the  course  of  a  fortnight's  work  I  had  completed  the 
length-breadth  organic  correlation,  the  length-length  homotypic  correlation,  and  about 
a  third  of  the  length-breadth  homotypic  correlation.  At  this  point  Dr.  Lee  took  the 
work  olf  my  hands  and  finished  the  last  table  and  the  breadth-breadth  homotypic 
correlation.  So  that  the  results  are  again  the  product  of  co-operation.'*  I  give 
below  the  data  arranged  as  in  the  case  of  the  mushroom  gills.  The  dimensions  are 
given  in  eighths  of  inches. 

*  We  ha\e  now  systematised  the  working  of  these  long  tables,  involving  10,000  to  60,000  entries 
providing  appropriate  checks  for  accuracy  up  to  each  stage  of  construction.  It  seems  unnecessary  to 
describe  these  here,  but  we  shall  be  glad  to  put  oiu-  experience  at  the  service  of  any  one  working  at 
similar  problems.  At  the  same  time  the  collection,  measurement  and  formation  of  a  table  for  the  cross 
homotypic  correlation  of  two  characters  in  25  undifferentiated  like  organs  of  100  individuals  will  cost  a 
siiujle  worker  at  least  three  weeks  to  a  month's  fairly  continuous  labour. 
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Absolute  Dimensions  of  Wild  Ivy  Leaves. 


Character. 

Mean. 

S.  D. 

CootRcient 
of  variation. 

S.  D.  of 
array. 

Percentage 
variation. 

Correlation. 

Length.  . 
Breadth  . 

10-9504  +  -0457 
13-2148  +  -0323 

3-  3885  ±  -0612 

4-  5384  ±  -0433 

30-9442 
34-3430 

2  -  8033 
3-8376 

82-73 
84-56 

-5618  [+  -0092] 
-5332  [±  -0096] 

Cross-  Correlations. 


Length  and  lireadth,  organic    .  . 

•8718  +  -0032 

Length  and  Ijreadth,  homotypic  . 

-5157  [±  -0099] 

These  results  diflPer  very  considerably  from  those  for  the  mushroom  gills.  The 
coefficients  ol  variation  are  even  higher  than  those  for  the  gills,  and  I  think  this  is  a 
result  of  a  certain  amount  of  heterogeneity,  as  well  as  of  the  leaves  on  individual 
runners  not  being  all  quite  at  the  same  stage  of  development.  In  the  case  of  the 
deciduous  trees  the  leaves  were  gathered  in  the  fall  of  the  year,  and  no  further 
development  of  veins  was  possible  ;  but  in  the  ivy  leaves,  taken  indeed  at  the  same 
time,  the  equal  development  of  all  the  leaves  taken  from  the  runner  could  hardly  have 
been  reached  when  they  were  gathered.  On  the  other  hand  the  correlations  are  much 
lower  than  in  the  case  of  the  gills, — much  more  withm  the  range  of  the  results 
obtained  for  other  characters,  I  am  inclined  to  think,  therefore,  that  there  has  been 
some  balancing  of  opposing  factors  here,  heterogeneity  due  to  locus  of  collection  and  to 
stages  of  growth  being  to  some  extent  counteracted  by  a  differentiation  due  to  position 
of  the  leaf  on  the  runner.'^  I  have  not  included  these  results  for  lengths  and  breadths 
of  ivy  leaves  in  my  homotypic  series  because  the  values,  although  well  within  the 
range  of  the  other  determinations,  appear  to  me  to  be  somewhat  fictitiously  so.  The 
disturbing  factors  referred  to  above  seem  to  me  to  have  also  given  the  cross  homotypic 
correlation  a  higher  value  than  it  ought  to  have.  It  will  be  seen  that  the  correlation 
of  length  and  breadth  of  pairs  of  leaves  from  the  same  runner  is  almost  as  high  as  that 
of  the  breadths  of  paii's  of  leaves  from  the  same  runner.  I  should  expect  a  consider- 
ably greater  inequality.  I  regret  that  the  great  labour  of  these  cross-correlation 
investigations  has  hindered  their  being  carried  further  than  these  two  cases  in  the 
vegetable  kingdom,  but  in  the  zoological  data,  which  we  have  at  present  in  hand,  I  hope 
to  get  material  less  open  to  criticism  than  in  the  cases  of  mushroom  gills  and  ivy  leaves. 
These  two  series  were  originally  undertaken  with  the  view  of  ascertaining  how  far 
the  use  of  indices  would  cancel  the  influence  of  the  factor  of  growth.    As  we  have 

*  The  reader  must  of  com^se  bear  in  mind  that  the  leaves  were  taken  from  runners  which  had  not 
reached  the  tops  of  their  walls  or  trees,  i.e.,  they  were  not  from  reproductive  shoots. 
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seen  the  mean  of  the  index-correlations  for  mushroom  gills  and  ivy  leaves  =  '4108,  a 
result  by  no  means  bad,  or  widely  divergent  from  the  mean  result  of  all  our  data. 

If  we  test  the  hypothesis  as  to  the  value  of  the  cross-correlation  we  have  the 
following  results  arranged  as  in  the  case  of  the  mushrooms  : 

Product  of  direct  homotypic  length  correlation  with  organic  length- 
breadth  correlation  =  "4897 

Product  of  direct  homotypic  breadth  correlation  and  organic  length- 
breadth  correlation  =  "4648 

Product  of  mean  of  direct  homotypic  length  and  l^readth  correlations 

with  the  organic  length-breadth  correlations  =  '4773 

These  numbers  have  to  be  compared  with  the  cross  length -l)readth  correlation,  i.e., 
with  -5157. 

We  see  that,  as  in  the  case  of  the  mushroom,  they  are  somewhat  too  small, 
48  say  as  compared  with  52.  But  the  difference  is  considerably  less  here,  and 
allowing  for  the  action  of  disturbing  factors,  I  think  we  may  say  that  the  two 
quantities  under  investigation  are  at  least  of  the  same  order  of  magnitude.  There 
I  think  we  must  leave  the  hypothesis  until  my  zoological  measurements  are 
reduced. 

I  give  the  four  tables  of  classified  data  for  the  absolute  lengths  and  breadths  of 
the  ivy  leaves.  In  the  first  or  organic  correlation  table  the  reader  will  be  able,  in  the 
distribution  at  least  of  breadths,  to  find  something  of  the  irregularity  to  which  I  have 
already  referred. 


VI.  Summary  of  Results. 

(23.)  In  summing  up  my  results  and  comparing  them  with  those  obtained  for 
fraternal  correlation  by  my  co-workers  and  myself,  I  felt  some  difiiculty.  If  I  made  a 
selection  of  what  I  considered  the  best  homotypic  correlation  series,  and  the  best 
fraternal  correlation,  I  might  well  lay  myself  open  to  the  charge  of  selecting  statistics 
with  a  view  to  the  demonstration  of  a  theoretical  law  laid  down  beforehand. 
Accordingly,  I  determined  to  include  all  my  homotypic  results,  excejjt  those  for  the 
absolute  dimensions  of  mushroom  gills  and  ivy  leaves,  where  it  was  pretty  evident 
that  we  had  to  a  greater  or  less  degree  an  influence  exerted  by  the  growth  factor. 
I  thus  drew  up  Table  XXXII.,  containing  a  summary  of  all  my  results.  I  am  quite 
sure  that  heterogeneity  due  to  one  or  another  cause,  exerts  an  influence  of  one  kind 
at  the  top,  and  differentiation  an  influence  of  another  kind  at  the  bottom  of  this 
table.  The  amount  of  these  influences  and  of  other  disturbing  causes,  one  cannot 
measure  and  allow  for.  I  can  only  hope  that  having  taken  a  fairly  wide  range  of 
races  and  characters,  the  influences  tending  to  obscure  the  homotypic  correlation,  on 
the  one  hand  by  raising  it,  and  on  the  other  by  reducing  it,  will  about  balance,  and 
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the  average  of  the  whole  series  be  a  ftiirly  close  apj)roximation  to  tlie  true  mean 
value  of  homotypic  correlation.  The  result  is  '4:570.  Turning  now  to  our  results  for 
fraternal  correlation,  I  put  into  a  second  table  every  single  result  that  we  have 


Table  XXXII. — General  Results  for  Homotypic  Correlation. 


Race. 


Mushroom,  Hampden    .  . 
))  ))         •  ■ 

))  )!  •  ■ 

Wild  Ivy,  mixed  localities 


(i.)  Ceterach,  Somersetshire 
(ii.)  Hartstongue,  Somerset- 
shire 

(iii.)  Shirley  Poppy,  Chelsea  . 
(iv.)  English  Onion,  Hampden 

(v.)  Holly,  Dorsetshire 
(vi.)  Spanish  Chestnut,  mixed 
(vii.)  Beech,  Buckinghamshire 
(viii.)  Papaver  L'lia'iix,  Hampden 
(ix.)  Mushroom,  Hampden 

(x.)  Papaver  liliQ'ux,  Quantocks 
(xi.)  Shirley  Poppy,  Hampden 
(xii.)  Spanish  Chestnut,  Buck- 
inghamshire 
(xiii.)  Broom,  Yorkshire 
(xiv.)  Ash,  jNIoinnouthshire 
(xv.)  Pa/iavcr   PIia>as,  Lower 

Chilterns 
(xvi.)  Ash,  Dorsetshire  .    .  . 
(xvii.)  Ash,  Buckinghamshire  . 
(xviii.)  Holly,  Somersetshire 
(xix.)  Wild   Ivy,  mixed 
localities 

(xx.)  Nujella  HisjKtnica,  Slough 

(xxi.)  Maha  BotundifoUa, 

Hampden 
(xxii.)  A7oodmtf,  Buckingham- 
shire 


Character. 


Lengths  of  gills  . 
Breadths  of  gills  . 
Lengths  and 

hreadths  of  gills 
Lengths  of  leaves  . 
Breadths  of  leaves 
Lengths  and 

breadths  of  leaves 


Mean  of  22  cases 


Lolies  on  fronds 
Sori  on  fronds 


Stigmatic  l)ands 
Veins  in  tunics 


Prickles  on  leaves 
Veins  in  leaves 
Veins  in  leaves . 
Stigmatic  bands 
Gill  indices  .  . 


Stigmatic  Itands 
Stigmatic  bands 
Veins  in  leaves. 


Seeds  in  pods  . 
Leaflets  on  leaves 
Stigmatic  bands 

Leaflets  on  leaves 
Leaflets  on  leaves 
Prickles  on  leaves 
Leaf  indices .  . 


Segments  of  seed- 
capsides 

Segments  of  seed- 
vessels 

Member  s  of  whoi'ls 


Per- 


variation. 


Corre- 
lation. 


G7' 


82' 
84' 


92 
G7 


73 
.56 


77 -.57 

77-  64 

78-  86 

79-  18 

80-  11 
80  -  6.5 
82-17 

82-  71 

83 -  .58 

84-  59 

85-  18 
88-51 

90-  96 

91-  44 
91-66 

91-  81 

92-  73 

93-  12 
96-21 


98-18 
98-32 
98-49 


87-44 


■  8607 
■7363 

■  6275 

■5618 
•5332 
-5157 


6311 
6303 

6149 
6108 

5985 
5913 
5699 
5620 
5490 

5333 
5238 
4655 

4155 
4047 
3997 

3964 
3743 
3548 
2726 


1899 
•1827 
■1733 

•  4570 


Remarks. 


All  these  results  introduce 
a  correlation  due  to  stages 
of  growth  and  accordingly 
are  not  included  in  the 
determination  of  means. 


Said  to  be  largely  aft'ectcd 
by  growth  and  environ- 
ment. 

Much  selected  in  transit. 
Possibly    slightly  .  hetero- 
geneous. 


Heterogeneous. 


Po.ssibly  influenced  l)y  indi- 
^  idual  arowth. 


All  from  one  field. 


From  two  localities  and  pos- 

siljly  slightly  influenced 

by  differentiation. 
Differentiation   of  organs 

due  to  position  on  stem. 
Principally  spread  from  one 

clump  by  stolons. 
Members  really  different  in 

morphological  origin. 


worked  out  up  to  the  date  of  writing  this  memoir.  I  felt  this  was  the  only  safe 
method,  although  I  might  be  justified  in  cutting  out  several  values  from  the  top  and 
from  the  bottom  of  this  table.    I  am  sure  the  value  for  Daphnia  is  much  too  high, 
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and  the  value  for  longevity  in  man  much  below  the  true  fraternal  correlation.  In 
the  former  case,  the  mothers  were  few  in  number,  in  the  latter  the  non-selective 
death-rate  reduces  very  considerably  the  intensity  of  collateral  inheritance.  Both 
longevity  and  temper  are  included  in  this  table  for  the  same  reason  as  Nigella 
Hispmiica  and  woodruff  in  the  first  table.  I  would  not  run  the  risk  of  any  apparent 
selection  to  reduce  either  homotypic  or  fraternal  correlation  to  a  closer  range  of  values. 
The  mean  of  this  second  table  gives  the  value  "4479  for  fraternal  correlation.  Now  I 
do  not  propose  to  lay  great  stress  on  what  at  first  sight  might  look  like  a  most  con- 
clusive equality  between  the  mean  values  of  homotypic  and  fraternal  correlations, — 
within  the  limits  of  the  probable  errors  "4479  and  "4570  are  indeed  equal.  I  am  quite 
aware  that  a  few  further  series  added  to  either  the  homotypic  or  fraternal  results 
might  modify  to  some  extent  this  equality.  But  what  I  would  ask  the  reader  to  do 
is  to  examine  the  two  tables  side  by  side,  to  note  how  the  first  and  last  several 
results  of  both  may  fairly  be  held  to  be  subject  to  quite  definite  modifying  factors, 
and  then  to  consider  whether  there  is  not  very  substantial  evidence  gathered  from  a 
fairly  wide  range  of  characters  in  nearly  as  wide  a  range  of  species  to  show  that  both 
homotypic  and  fraternal  correlation  fluctuate  about  a  mean  value  between  "4  and  "5. 

1  will  not  venture  to  assert  that  either  are  absolutely  constant,  but  I  do  realise 
that  it  is  extremely  difiicult  with  the  complex  system  of  factors  influencing  living 
forms  to  reduce  our  conditions  to  that  theoretically  perfect  state  in  which  we  shall 
measure  solely  the  factor  we  are  investigating.  If  the  intensity  of  homotyposis  were 
exactly  '45,  I  should  be  inclined  to  distrust  any  long  series  of  results,  one  and  all  of 
which  gave  the  answer  '45  exactly.  There  are  so  many  other  disturbing  factors  which 
only  those  who  have  endeavoured  to  collect  series  of  this  kind  will  fully  appreciate. 
In  the  first  place,  the  theoretical  conception  of  undifferentiated  like  organs  is  very 
hard  to  realise  practically  ;  position  of  the  organ  on  the  branch  or  of  the  branch  on  a 
plant,  however  careful  be  the  collector,  may  really  have  introduced  diflerentiation,  and 
so  weakened  the  apparent  homotyposis.  Secondly,  the  environmental  factor  comes 
into  play.  It  is  difiicult  to  obtain  a  hundred  individuals  with  like  environment ;  soil, 
position  with  regard  to  other  growths,  sunlight,  insect  life,  &c.,  may  difler  in  a  manner 
that  the  collector  cannot  appreciate.  Unlike  environment  may  produce  a  fictitious 
likeness  in  the  organs  of  the  same  individual  when  we  pass  from  one  individual  to  a 
second.  The  fact  that  half  our  series  grew  in  one  field,  the  other  half  in  another, 
that  part  came  from  one  side  of  a  road,  part  from  another,  may  introduce  an  unper- 
ceived  heterogeneity  which  increases  the  apparent  homotyposis.  Thirdly,  the 
difficulty  of  ensuring  that  all  individuals  are  of  the  same  age  or  in  the  same  stage 
of  development,  is  very  great.  The  leaves  of  an  old  tree  may  have  a  fictitious 
likeness  when  compared  with  those  of  a  young  tree  ;  we  may  gather  organs  from  one 
individual  when  it  is  in  a  stage  of  development,  which  would  only  be  reached  in 
another  individual  some  days  or  weeks  later.  These  and  other  factors  may  perhaps 
be  to  some  extent  eliminated — far  better  of  course  by  the  trained  botanist  than  by 
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the  mere  mathematician— but  I  very  much  doubt  the  possibihty  of  their  complete 
elimination.  If  homotyposis  had  a  practically  constant  value  throughout  nature,  I 
should  only  expect  this  value  to  be  ascertained  as  a  result  of  the  average  of  many 
series  in  which  the  opposing  factors  of  differentiation,  environment,  age,  stage  of 
growth,  &c.,  may  more  or  less  counteract  each  other.  In  this  manner  we  may 
approach  to  a  fair  appreciation  of  the  bathmic  influence  of  individuality  in  the  pro- 
duction of  undifferentiated  like  organs.  What  I  should  accordingly  deduce  as 
legitimate  from  the  above  general  results  would  be  this,  the  intensity  of  pure  homo- 
typosis throughout  the  vegetable  kingdom  probably  lies  between  '4  and  '5  ;  this  is  also 
the  mean  value  found  up  to  the  present  for  fraternal  correlation.  We  may  accordingly 
conclude  that  heredity  is  really  only  a  phase  of  the  wider  factor  of  homotyposis.  At 
bottom  it  is  only  part  of  the  principle  that  when  an  individual  puts  forth  undiffer- 
entiated like  organs  these  are  not  exactly  the  same,  but  with  a  definite  intensity  of 
variation  have  a  definite  degree  of  likeness.  When  we  associate  heredity  with  sexual 
reproduction,  we  are  only  considering  the  result  of  homotyposis  (variation  and  like- 
ness) between  individual  spermatozoa  and  between  individual  ova.  Such  homotyposis 
leads  to  a  likeness  of  the  individuals  resulting  from  the  zygotes,  which  has  probably 
the  same  mean  value  as  homotypic  correlation  itself  Thus  if  the  continuity  of  the 
germ  cells  between  parent  and  offspring  be  realised,  we  face  no  longer  the  problem  of 
heredity,  but  that  of  homotyposis,  and  this  again  might  possibly  be  reduced  to  the 
simplest  problem  of  budding  or  cell  multiplication.  Why  does  the  fundamental  life- 
unit  on  self-multiplication  produce  homotypes  with  a  definite  degree  of  likeness  and  a 
definite  degree  of  variation  ?  I  shall  hope  for  further  light  on  this  problem  when 
my  data  for  homotyposis  in  the  animal  kingdom,  already  being  collected,  are  some- 
what more  complete ;  but  only  the  biologist,  not  the  mathematician,  can  solve  it. 

(24.)  Now  let  us  turn  to  another  point :  homotyposis  involves,  as  we  have  seen, 
not  only  a  certain  degree  of  likeness  in  the  group  of  homotypes  but  a  certain  degree 
of  variation.  Our  series  does  not  include  any  groups  of  more  than  twenty-six  homo- 
types,  except  in  the  case  of  the  Hampden  Shirley  poppies.  Hence  it  is  not  really 
possible  to  calculate  directly  the  variability  of  the  individual.  But  from  theoretical 
considerations,  as  well  as  from  the  support  of  individual  instances,  we  have  seen  that 
the  standard  deviation  of  the  array  is  a  reasonable  measure  of  the  variability  of  the 
individual.*  Of  course  more  elaborate  direct  investigations  on  this  point  would  be 
of  great  interest.  But  I  consider  that  the  present  series  indicate  that  on  an  average 
the  variation  in  the  individual  is  some  87  to  88  per  cent,  of  that  of  the  race.  If  the 
reader  will  examine  the  column  headed  Percentage  Variation  in  Table  XXXII.,  he  will 
notice  that  excluding  the  lengths  and  breadths  of  mushroom  gills — cases  in  which 
the  stage  of  growth  is  all  important — no  percentage  variation  falls  below  77.  Now 

*  It  is  quite  easy  to  find  isolated  individuals  with  a  greater  degree  of  variability  than  this,  and  even  a 
greater  variability  than  that  of  the  race,  it  is  the  avercu/e  individual  variability  which  is  represented  by  the 
S.  D.  of  the  array. 
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this  seems  to  me  veiy  instructive  when  we  consider  tlie  statements  made  by  some 
writers  who  theorise  about  variation  rather  than  actually  measure  it  :  The  undifferen- 
tiated like  organs  put  forth  by  the  individual  have  in  round  numbers  80  to  90  jjer 
cent,  of  the  variation  oj  such  organs  in  the  race.  Does  not  this  completely  refute  the 
views  frequently  expressed  that  variation  is  the  result  of  sexual  reproduction,  and  that 
it  is  quite  insignificant  in  the  case  of  budding  ?  Sexual  reproduction  may  produce  a 
type  which  is  not  that  of  either  parent,  but  this  does  not  d  fortiori  alter  the  vari- 
ability of  the  race  with  regard  to  any  organ.  With  continu(jus  variation  such  type 
would  have  previously  existed  as  far  as  any  special  organ  or  character  is  concerned, 
and  its  repetition  contributes  nothing  to  the  racial  variability.  We  have  to  meet  the 
fact  that  the  individual  produces  undifferentiated  like  organs  with  a  remarkable 
degree  of  variety,  and  if  the  investigations  of  the  present  memoir  be  valid  the 
practical  result  of  the  homotyposis  factor  having  a  value  of  "4  to  "5  would  denote  that 
the  individual  variability  is  91 '65  to  86'60  per  cent,  of  the  racial. 

(25.)  Lastly,  we  may  consider  a  third  point  of  very  great  interest,  which  is 
involved  in  Table  XXXIV.  Here  we  have  the  coefficients  of  variation  given  for  some 
twenty-two  series.  Now  this  coefficient  seems  to  me  the  only  satisfactory  comparative 
measure  we  can  find  at  present  of  variability.  A  variation  of  two  in  the  petals  of 
a  buttercup  is  far  more  significant  than  one  of  two  in  the  fiorets  of  an  ox-eyed  daisy  ; 
the  measures  of  the  absolute  variations  as  given  by  the  standard  deviations  seem  to 
me  of  no  use  when  we  are  comparing  different  characters  in  different  species.  In 
default  of  the  suggestion  of  any  better  standard,  all  we  can  do  is  to  get  rid  of  absolute 
size  or  number  by  using  the  percentage  variation  of  the  character  as  indicated  in  the 
coefficients  of  variation.  Now  our  table  gives  a  fairly  continuous  series  from  7  "80  uj^ 
to  41*96.*  The  mean  value  of  19  is  in  very  good  accord  with  the  results  I  have 
obtained  for  variation  in  a  much  wider  series  for  the  vegetable  kingdom.  Now  I 
think  it  will  be  admitted  : 

(«.)  that  this  variation  is  based  on  results  for  a  wide  number  of  species ; 
(b.)  that  this  variation  covers  a  considerable  variety  of  characters  ; 
(f.)  that  it  is  roughly  continuous  in  value  between  8  and  40. 

But  if  we  examine  the  fourth  column  in  this  table,  which  gives  the  place  of  the 
corresponding  homoty^jic  correlation,  we  find  absolutely  no  relationship  between  the 
intensity  of  the  homotyposis  and  of  the  variability.  The  homotypic  order  will  not 
compare  in  any  way  with  the  variation  order.  The  mean  variability  of  the  first  eleven 
series  is  22*95,  and  of  the  last  eleven  series  14*28,  very  sensible  deviations  from  the 
mean  18*62  of  the  whole  twenty-two  series.  But  the  mean  homotypic  correlation  of 
the  first  eleven  series  is  -4559,  and  of  the  last  eleven  series  '4581,  neither  of  which 

*  If  we  excluded  the  hai  tstongue  as  largely  influenced  by  environment,  we  should  find  broom  heading 
the  list,  one  of  my  most  satisfactory  series,  both  as  to  similarity  of  individual  environment  and  as  to 
smoothness  of  frequency.    We  should  still  conclude  that  variation  might  reach  to  nearly  40. 
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exhibits  any  sensible  difference  from  "4570  the  mean  of  all  twenty-two  series.  We  are 
compelled  therefore  to  conclude  that  there  is  no  relationship  between  the  variability  of 
a  species  with  regard  to  any  organ,  and  the  homotypic  correlation  of  such  organs.  It 
would  be  quite  impossible  to  assert  that  as  the  variabiHty  of  a  species  decreased,  its 
homotyposis  increased.  But  according  to  the  results  reached  in  this  paper  heredity 
is  a  result  of  homotyposis.    Hence  there  seems,  so  far  as  our  researches  go,  no  ground 


Table  XXXIV. — General  Results  for  Variability. 


Coefficient 

Position  in 

Race. 

Character. 

of 

correlation 

variation. 

table. 

Hartstongue,  Somersetshire  .... 

Sori  on  fronds  

41-96 

(ii.) 

Broom,  Yorkshire  

Seeds  in  pods   

36-78 

(xni.) 

Holly,  Dorsetshire  

Prickles  on  leaves.    .    .  . 

26-29 

(V.) 

Nigella  Hispivnica,  Slough  

Segments  in  capsules.    .  . 

19-39 

(XX.) 

Holly,  Somersetshire  

Prickles  on  leaves .... 

18-74 

(xviii.) 

Ash,  Dorsetshire  

Leaflets  on  leaf  

18-65 

(xvi.) 

Ash,  Monmouthshire  

))  ))  

18-57 

(xiv.) 

Mushroom,  Buckinghamshire  .... 

Uill-indices  

18-28 

(ix.) 

Ceterach,  Somersetshire  

Lobes  on  fronds  .... 

18-25 

(i-) 

Papaver  Ehmas,  top  of  Chilterns  .    .  . 

Stigmatic  bands  .... 

17-81 

(vm.) 

Wild  Ivy,  mixed  

Leaf -indices  

17-77 

(xix.) 

Papaver  Iihreas,  Quantocks  

Stigmatic  bands  ..... 

17-66 

(X.) 

English  Onion,  Hampden  

Veins  in  tunics  

17-43 

(iv.) 

Veins  in  leaves  

1  ^  •  T  0 
iO    <  Ji 

(VI.) 

Ash,  Buckinghamshire  

))  ))  

15-46 

(xvii.) 

Papaver  Rhmas,  Lower  Chilteins  . 

Stigmatic  bands  .... 

15-27 

(XV.) 

Shirley  Poppy  Hampden 

))          ))  .... 

15-17 

(xi.) 

Spanish  Chestnut,  Buckinghamshire. 

Veins  in  leaves  

14-31 

(xii.) 

Shirley  Poppy,  Chelsea  

Stigmatic  bands  .... 

13-99 

(iii.) 

Woodrufi',  Buckinghamshire  .... 

Members  in  whorls    ,  . 

13-46 

(xxii.) 

Beech,  Buckinghamshire  

Veins  in  leaves  

10-77 

(vii.) 

Malva  Piotundifolia,  Hampden  .... 

Segments  in  seed-vessels 

7-80 

(xxi.) 

Mushroom,  Buckinghamshire  .... 

Lengths  of  gills  .... 

21-00 

))                ))  .... 

Breadth  of  gills  .... 

23-42 

Wild  Ivy,  mixed  .  ,  

Lengths  of  leaves  .... 

30-94 

Breadths  of  leaves    .    .  . 

34-34 

Mean  of  first  22  cases    .  . 

18-62 

Mean  of  all  26  cases  .    .  . 

19-97 

for  asserting  that  increased  intensity  of  heredity  means  decreased  intensity  of  varia- 
tion and  vice  versd.  The  general  impression  formed  upon  my  mind  by  our  measure- 
ments is  the  approximate  constancy  of  the  pure  homotypic  factor,  or  at  any  rate  its 
variation  within  a  fairly  small  range,  and  consequently  the  approximate  constancy 
within  a  like  small  range  of  the  hereditary  factor  for  all  species  and  all  characters. 
VOL.  cxcvii. — A.  3  A 
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On  the  other  hand,  great  diversity  in  the  variabiHty  of  different  species.  I  do  not 
wish  to  be  dogmatic  on  this  point,  however,  but  I  would  definitely  assert  that  so  far 
no  relationship  between  variability  and  homotyposis  is  discoverable.  Further,  it  does 
not  seem  to  me  that  looking  down  the  order  of  variability  we  can  assert  that  our  table 
in  any  way  corresponds  to  the  order  of  evolutionary  descent.  It  does  not  seem  to 
me  that  the  species  towards  the  top  of  it  are  the  more  primitive  and  those  towards 
the  bottom  the  more  complex.  Indeed  two  series  of  the  same  species  may  stand  at 
very  different  points  in  the  table.  I  am  accordingly  forced  to  the  conclusion  that  my 
table  gives  no  support  whatever  to  the  view  that  variability  in  the  vegetable  kingdom 
is  greatest  for  the  more  primitive  and  simpler  organisms.  Nor,  again  examining 
Table  XXXII.,  can  we  conclude  that  homotyposis  is  least  intense  for  such  organisms, 
for  the  second  half  of  our  table  cannot  be  said  to  contain  more  of  such  organisms  than 
the  first.  Shortly,  I  see  no  relation  between  the  position  of  an  organism  in  the 
evolutionary  descent  and  the  intensity  of  either  its  variability  or  its  homotyposis. 
Hence  it  follows  that,  if  homotyposis  be  the  source  of  heredity,  the  intensity  of  the 
latter  cannot  be  related  to  either  variability  or  position  in  evolutionary  descent. 

Mr.  Adam  Sedgwick  may  be  right  in  his  view*  that  in  the  early  stages  of  evolution 
variability  is  large  and  inheritance  small,  and  that  with  development  this  relation  is 
changed,  variability  decreasing  with  an  increasing  intensity  of  heredity.  My 
statistics  only  cover  a  small  portion  of  one  kingdom.  But  so  far  as  I  am  aware 
they  represent  the  only  material  so  far  published  on  which  a  definite  judgment  can  be 
founded — where  by  definite  judgment,  I  mean  one  based  on  quantitative  facts,  and 
not  drawn  from  a  general  estimate  of  nature  which  makes  no  statement  whatever 
of  the  intensity  of  either  variability  or  heredity  for  any  single  character  in  any  one 
species  of  living  things. 

The  data  I  have  given  seem  entirely  opposed  to  Mr.  Sedgwick's  view,  but  I  very 
fully  realise  that  they  are  far  from  exhaustive.  They  indicate,  however,  what 
I  hold  to  be  the  only  valid  method  of  approaching  any  problem  in  variation  or 
heredity,  i.e.,  laborious  statistical  collection  of  actual  facts.  My  statistics  may  be 
insufficient,  I  would  .heartily  welcome  additions  to  them ;  or  they  may  be  dealt  with 
fallaciously.  Here  is  the  material  ;  let  others  add  to  and,  if  need  be,  modify  the 
conclusions.  Only  let  us  follow  the  method  so  clearly  indicated  by  Darwin  himself 
in  his  '  Cross  and  Self-fertilisation  of  Plants  ; '  let  us  cease  to  propound  hypotheses 
illustrating  them  by  isolated  facts  or  vague  generalities  ;  there  are  innumerable 
species  at  hand  in  Nature  ready  for  us  to  measure  and  count.  Sine  niimero  nihil 
demonstrandum  est,  should  now-a-days  be  the  motto  of  every  naturalist  who  desires 
to  propound  novel  hypotheses  with  regard  to  variation  or  heredity.  It  is  equally  valid 
for  those  who  merely  seek  either  to  establish  more  firmly  or  to  modify  the  old. 

(26.)  I  would  sum  up  the  results  of  this  memoir  in  the  following  conclusions  : 

*  Presidential  Address  to  Section  D  of  the  British  Association,  1899. 
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(i.)  When  an  individual  of  any  species  produces  a  number  of  undifferentiated  like 
organs  (homotypes)  these  are  not  exactly  alike  when  measured  with  regard  to  any 
character.  They  exhibit  a  certain  degree  of  variation  combined  with  a  certain  degree 
of  likeness  (homotypic  correlation). 

(ii.)  The  homotypic  correlation  as  tested  for  twenty-two  series  of  homotypes  in  a 
variety  of  species  in  the  vegetable  kingdom,  has  a  mean  value  between  '4  and  "5.  Its 
exact  determination  is  rendered  difficult  by  a  number  of  conflicting  factors ;  but  it 
seems  very  possible  that  the  actual  value  of  pure  homotypic  correlation,  i.e.,  the 
correlation  due  to  the  individuahty  of  the  plant  as  apart  from  the  influence  of 
environment,  &c.,  differs  but  little  from  the  mean  value  stated  above. 

(iii.)  If  a  character  occurs  in  a  series  of  homotypes,  it  will  be  found  to  have  in  the 
individual  a  variability  80  to  90  per  cent,  of  the  variability  of  this  character  in  the 
species  at  large.  It  is  thus  seen  that  variability  is  a  primary  factor  of  living  forms, 
and  is  not  in  any  way  dependent  on  sexual  reproduction. 

(iv.)  With  a  certain  hypothesis  as 'to  cross  homotypic  correlation,  i.e.,  the  correlation 
of  two  different  characters  in  homotypes,  namely,  that  it  is  the  product  of  the  direct 
homotypic  and  of  the  organic  correlations,  it  would  follow  that  the  mean  fraternal 
correlation  would  equal  the  mean  homotypic  correlation.  The  mean  of  twenty-two 
homotypic  series  is  found  to  be  sensibly  identical  with  the  mean  of  nineteen  fraternal 
series.  A  direct  investigation  of  the  value  of  the  cross  homotypic  correlation  is  only 
made  for  two  cases,  that  of  the  mushroom  gill  and  that  of  the  ivy  leaf  The  cross 
homotypic  correlation  is  not  found  to  differ  very  widely  from  the  product  of  the 
direct  homotypic  and  the  organic  correlations  in  these  cases,  but  the  results  are  not 
close  enough  to  be  conclusive. 

(v.)  It  would  seem,  if  (iv.)  be  true,  that  heredity  is  only  a  phase  of  the  principle 
of  homotyposis  (ii.),  and  that  the  numerical  value  of  its  constants  may  be  found  from 
that  principle. 

(vi.)  No  relation  whatever  could  be  found  between  the  intensity  of  variability  and 
that  of  homotyposis  in  the  twenty-two  series  dealt  with.  Nor  was  any  relation  to 
be  observed  between  the  relative  simplicity  of  the  organism  and  the  intensity  either 
of  its  variability  or  its  homotypic  correlation.  Kegarding  heredity  as  a  case  of 
homotyposis,  there  seems  no  reason  to  suppose,  as  it  has  been  suggested,  that 
variability  has  decreased  and  heredity  increased  in  the  course  of  evolution.  On  the 
contrary  there  seems  some  ground  for  supposing  that  homotyposis  (and  therefore 
heredity)  is  a  primary  factor  of  living  forms,  a  condition  for  the  evolution  of  life  by 
natural  selection,  and  not  a  product  of  such  selection.  If  the  mushroom,  the  poppy, 
and  the  beech  show  approximately  equal  homotyposis,  it  seems  well  nigh  impossiblo 
to  consider  it  as  a  factor  of  life,  increasing  with  advancing  evolution. 
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Appendix. 

On  the  Homotyposis  of  the  Pods  of  Leguminous  Plants. 

I  have  already  indicated  that  I  had  initially  grave  doubts  as  to  whether  a  true 
measurement  of  homotyposis  could  be  obtained  by  counting  the  ripe  seeds  in  the 
pods  of  leguminous  plants ;  but  that  a  passage  in  a  work  of  Darwin's  led  me  to 
modify  my  opinion,  and  when  finishing  my  memoir,  I  found  time  to  incorporate  in  it 
the  broom  series  from  Danby  Dale  which  seemed  to  confirm  Darwin's  views.  Broom 
is  cross-fertilised  and  much  visited  by  insects,  and  accordingly  the  statement  made 
on  p.  334  of  this  memoir  appears  to  have  full  validity.  As  the  matter  seemed  of  some 
interest,  I  suggested  to  Miss  C.  D.  Fawcett  a  fuller  study  of  leguminous  plants 
from  this  standpoint.  Her  results  have  reached  me  in  time  to  be  reduced  by  Dr.  Lee 
and  to  be  incorporated  in  this  Appendix.  I  have  myself  worked  series  for  the  common 
bean  and  the  tare  vetch,  so  that  if  these  be  included  we  have  now  a  fairly  extensive 
range  of  material  from  which  to  draw  conclusions. 

I  will  first  give  the  data  and  state  the  manner  in  which  it  has  been  reduced,  and 
then  consider  the  results  which  flow  from  its  consideration.  The  following  is  a  list 
of  the  species  dealt  with  ;  I  have  remarked  on  some  of  the  difficulties  arising  in 
the  course  of  our  work. 

(i.)  Cytisus  Scoparius.  Broom  from  Danby  Dale  (see  p.  334). — It  was  found  difficult 
to  count  the  total  number  of  ovules,  and  only  the  ripened  seeds  were  dealt  with. 
The  plant  is  cross-fertilised.    The  pods  were  fairly  free  from  insects. 

(ii.)  Lotus  Corniculatus.  Bird's-foot  trefoil  from  the  neighbourhood  of  Lyme 
Regis,  Dorsetshire. — The  aborted  seeds  could  not  be  counted,  and  the  pods  were 
largely  frequented  by  grubs,  so  that  it  was  occasionally  doubtful  how  far  ripe  seeds 
had  been  destroyed.  The  number  of  pods  on  each  plant  in  this  first  series  were  too 
few  too  allow  those  with  grubs  in  them  to  be  neglected.* 

(iii.)  Lotus  Corniculatus. — A  second  series  from  the  neighbourhood  of  Lyme  Regis. 
This  series  consisted  of  more  vigorous  plants  from  a  slightly  different  environment,  so 
that  fourteen  to  twenty  pods  could  be  gathered  from  each,  and  thus  difficulties  as  to 
the  possible  destruction  of  ripe  seeds  by  grubs  avoided.  It  would  appear  that  the 
effect  of  rejecting  such  pods  has  raised  the  mean  number  of  seeds,  but  at  the  same 
time  artificially  reduced  the  homotyposis.  The  plant  is  cross- fertilised.  (Darwin  : 
'  Cross  and  Self-Fertilisation  of  Plants,'  p.  361.) 

(iv.)  Lathy rv,s  Odoratus.  Sweet  Pea. — This  was  obtained  from  a  row  in  a  nursery 
garden  at  Lyme  Regis.  Both  the  fertilised  and  aborted  seeds  were  counted,  so  that 
the  total  number  of  ovules  could  be  found.    The  plant  is  said  to  be  cross-fertilised  in 

*  As  an  illustration  of  this  sort  of  difficulty  I  may  note  that  I  found  something  like  90  per  cent,  of  the 
pods  of  gorse  in  the  Danby  district  so  full  of  insect  life  that  nothing  could  be  done  in  regard  to  counting 
their  seeds. 
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countries  where  it  is  indigenous,  but  Daewin  ('  Cross  and  Self-Fertilisation  of 
Plants,'  3rd  edition,  p.  155,  et  seq.)  strongly  believes  it  to  be  wholly  self-fertihsed  in 
England, 

(v.)  Lathyrus  Odoratus. — A  shorter  series  obtained  from  a  second  nursery  garden 
at  Lyme  Regis. 

(vi.)  Lathyrus  Sylvestris.  Everlasting  pea  from  the  sea  coast  not  very  far  from 
Lyme  Regis.  —Both  the  ripe  and  the  aborted  seeds  were  counted.  The  plant  appears 
to  be  cross-fertilised.  Professor  F.  O.  Oliver  tells  me  that  he  has  watched  bees 
effectively  visiting  the  cultivated  everlasting  pea. 

(vii.)  Vicia  Faha.  Common  Bean. — I  took  100  plants,  each  having  at  least  ten 
pods,  from  a  field  in  Danby  Dale,  in  which  a  mixture  of  oats,  the  common  pea,  the 
common  bean,  and  three  or  four  vetches*  (described  by  the  owner  as  tares),  was 
growing  as  food  for  cattle.  The  perfect  beans  only  were  counted,  but  the  plants 
were  a  poor  crop,  many  pods  being  stunted  in  their  growth,  and  it  was  not  always 
easy  to  determine  whether  the  seeds  had  not  been  fertilised  or  there  had  been  failure 
owing  to  want  of  nutrition.  The  common  bean  is  both  cross  and  self-fertilised. 
I  have  to  thank  Miss  J.  Sharpe  and  Miss  E.  Cyriax  for  aid  in  the  work  on  these 
beans. 

(viii.)  Vicia  Hirsuta. — I  found  twenty-eight  plants  of  the  tare  vetch  growing  on  a 
strip  of  uncut  grass  along  a  cut  cornfield  at  Botton,  Danby  Dale.  I  could  find  no 
more  plants  in  the  neighbourhood.  There  was  plenty  of  the  common  vetch  [Vicia 
Sativa)  on  the  same  strip,  but  the  plants  had  rarely  more  than  two  to  three  pods  on 
them.  I  was  not  able  to  get  ten  pods  from  each  tare  vetch,  thirteen  plants  had  fewer, 
and  seven  plants  more.  The  series  being  short,  the  probable  errors  are  high,  but  it 
seemed  worth  while  to  include  the  data.  I  endeavoured  to  count  both  ripe  and 
aborted  seeds,  but  here,  as  in  one  or  two  of  the  previous  cases,  I  much  doubt  whether 
we  have  succeeded  in  counting  all  the  ovules.  In  some  of  the  green  pods  the  number 
of  ovules  seemed  to  be  considerably  larger  than  in  the  fully  ripe  pods,  and  I  think  it 
possible  that  the  non-fertilised  seeds  shrink  till  they  are  quite  unnoticeable  even  by 
a  cautious  observer.  Again,  it  is  by  no  means  certain  here,  as  in  other  cases,  that  all 
the  seeds  reckoned  as  aborted  are  really  non-fertilised.  It  is  quite  possible  that  in 
some  cases  fertilised  seeds  have  dwindled  for  want  of  nutriment  till  they  appear 
aborted. 

Vicia  Hirsuta  is  either  cross  or  self-fertilised  (Darwin  :  '  Cross  and  Self-Fertilisa- 
tion of  Plants,'  p.  367).  Whether  this  vetch,  or  indeed  the  common  bean,  were 
in  our  case  wholly  cross-fertilised  or  in  part  self-fertilised,  I  do  not  see  that  we  have 
any  means  of  settling. 

Table  L.  gives  the  frequency  distributions  of  the  various  series.  Now  in  these  series 
we  must  bear  in  mind  that  we  are  only  definitely  certain  of  one  thing,  the  number 

*  Neither  the  pea  nor  the  vetches  offered  enough  pods  per  plant  for  an  investigation  of  their  homotyposis 
to  be  of  value. 
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of  fully  ripened  seeds — and  this  only  when  the  pods  were  not  infested  with  grubs.  We 
did  the  best  we  could  to  count  the  aborted  seeds,  but  we  cannot  be  certain  that  all 
those  counted  as  aborted  were  non-fertilised,  or  that  we  succeeded  in  counting  all 
those  which  had  shrunk  to  microscopic  proportions.  The  number  of  ovules  is  simply 
the  sum  of  the  ripe  and  the  aborted  seeds  counted  in  each  pod,  and  this  again  must  be 
somewhat  doubtful.  The  reader  will  bear  in  mind  that  we  proceeded  with  care,  but 
that  we  think  it  right,  if  anything,  to  rather  over-  than  under-emphasise  possible 
sources  of  error.  Yet  allowing  for  such  sources  of  error  we  cannot,  on  examining  the 
results  given  in  Table  LI.,  allow  that  Broom  is  in  the  least  representative  of  the 
degree  of  homotyposis  to  be  found  in  the  pods  of  leguminous  plants.  Whether  we 
consider  the  ovules,  the  ripe,  or  the  aborted  seeds,  our  results  are  sensibly  beloM^  that 
for  broom,  and  the  mean  of  the  whole  sixteen  series  gives  us  a  value  about  one-half 
that  of  the  homotypic  correlation  based  upon  characters  not  depending  on  fertilisation. 
If  we  deal  with  averages,  it  would  certainly  seem  that  in  the  results  flowing  from 
fertilisation,  we  have  reduced  the  intensity  of  the  individuality  to  about  half  its 
previous  value. 

To  this  extent  only  does  the  individual  constitution  appear  influential  in  the 
number  of  seeds  in  the  pod,  the  remainder  of  the  homotypic  intensity  seems  to  have 
disappeared  under  random  influences  having  nothing  to  do  with  the  individuality  of 
the  plant.  This  is  perhaps  what  we  might  expect  in  the  case  of  ripe  seeds  in  cross- 
fertilised  plants,  where  the  fertilisation  may  depend  on  the  chance  or  not  of  insect 
visitation  and  the  effectiveness  or  not  of  the  pollen  brought  on  such  occasions.  We 
should  have  to  assert  that  the  bird's-foot  trefoil  and  the  everlasting  pea,  whose 
average  for  homotypic  correlation  in  the  case  of  ripe  seeds  is  about  "21  to  "22,  lose 
half  the  intensity  of  their  individuality  through  the  random  nature  of  the  chances  of 
the  cross-fertilisation.  But  although  this  might  be  fairly  satisfactory  for  these  cases, 
what  are  we  to  say  for  the  species  which  are  self-fertilised  absolutely  or  self-fertilised 
in  default  of  cross-fertilisation  ?  We  might  have  expected  a  high  degree  of  homotyposis 
in  the  field  bean  or  the  tare  vetch,  where  failing  cross-fertilisation  we  are  told  there 
will  be  self-fertilisation.  We  find  on  the  contrary,  however,  in  these  results  some  of 
the  lowest  homotypic  correlations  of  the  whole  series.  The  sweet  pea  also  in  its  two 
series  presents  some  very  remarkable  results.  If  the  sweet  pea  be  entirely  self- 
fertilised  then  we  should  expect  the  homotyposis  of  both  ovules  and  ripe  seeds  to  be 
fully  up  to  the  average.  In  neither  case  is  such  a  result  reached,  although  in  one 
series  we  have  a  value  for  the  ovules  higher  than  that  obtained  for  anything 
except  the  broom.  The  noteworthy  fact,  however,  is  that  the  results  for  the  two 
series  of  sweet  peas  differ  so  widely  in  character  !  In  the  first  series  it  is  the 
abortion  which  is  most  individual  in  character ;  in  the  second  series  it  is  the  ovules. 
In  the  first  series  the  ripe  seeds  have  far  less  individuality  than  the  aborted  seeds, 
in  fact,  the  smallest  homotyposis  I  have  yet  observed  ;  in  the  second  series  the  ripe 
seeds  have  a  value  rising  to  '2,  which  is  higher  than  that  of  the  aborted  seeds.  Ash 
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trees  from  Monmouthshire,  Buckinghamshire,  and  Dorsetshire,  gave  results  in  good 
agreement ;  but  sweet  peas  from  two  different  nursery  gardens  in  the  same  district 
give  strikingly  divergent  homotypic  correlations  ! 

According  to  Darwin  ('  The  Effects  of  Cross  and  Self-Fertilisation,'  3rd  edition, 
p.  153)  the  sweet  pea  "  in  this  country  seems  invariably  to  fertilise  itself."  He  bases 
this  statement  on  (a)  the  difficulty  of  access  to  bees  and  other  insects,  (h)  the  fact  that 
the  varieties  are  habitually  grown  by  seed  growers  close  together,  and  yet  the  colours 
do  not  blend,  (c)  the  experience  that  when  the  varieties  are  artificially  crossed  the 
colours  do  change.  Professor  Delphino,  of  Florence,  in  a  letter  to  Darwin,  writes 
that  it  is  the  fixed  opinion  of  the  Italian  gardeners  that  the  varieties  do  cross  ;  but 
in  Italy  other  insects  may  of  course  be  available.  Now  it  is  most  remarkable  that  in 
Series  I.,  whether  the  sweet  pea  be  crossed  or  self-fertilised,  the  number  of  ripe  seeds 
should  be  hardly  individual  at  all.  It  would  indicate  that,  with  the  particular 
environment  of  this  series,  the  chance  even  of  self-fertilisation  depends  upon 
extraneous  causes.  The  comparatively  high  individuality  in  the  tendency  to  abort 
may  mark  some  peculiarity  of  this  variety ;  there  may  be  a  distinct  inheritance  of 
sterile  tendencies  to  be  kept  separate  from  an  inheritance  of  fertility.  But  it  is 
probably  idle  to  guess  at  explanations  of  such  discrepancies.  I  have  gone  carefully 
into  the  differences  of  environment  in  the  two  series  which  are  interesting,  although 
they  do  not  obviously  provide  any  key  to  the  mystery.  The  first  series  were  thickly 
planted  in  a  long  row,  and  the  seed  pods  were  gathered  unripe.  It  was  more 
difficult  to  separate  each  plant  and  to  be  quite  certain  that  some  of  the  seeds  would 
not  have  had  to  be  reckoned  as  aborted  in  the  fully  ripened  pods.  The  second  series 
consisted  of  withered  plants,  the  pods  being  almost  all  completely  ripe ;  the  peas 
were  planted  in  separate  groups  a  few  feet  apart.  There  was  no  difficulty  in 
ascertaining  the  individuality  of  the  plant  nor  as  to  the  number  of  ripe  or  aborted 
seeds.  About  forty  hives  of  bees  were  kept  in  the  immediate  neighbourhood  of  this 
second  series  ;*  the  first  series  were  at  the  other  end  of  Lyme  Begis,  and  removed 
from  hives.  Both  series  had  had  their  flowers  freely  cut,  I  might  have  been 
prepared  to  attribute  the  low  value  of  the  homotyposis  in  the  sweet  pea  to  this 
latter  cause,  but  then  the  common  bean  and  the  tare  vetch  had  not  been  subjected 
to  any  similar  process,  and  they  give  on  the  whole  much  lower  values.  The  ovules 
in  either  self-fertilised  or  cross-fertilised  plants  ought  indeed  to  give  a  fair  measure 
of  homotyposis,  but,  for  reasons  already  stated,  it  is  not  certain  that  our  procedure 
in  counting  the  perfect  and  the  aborted  seeds  in  the  ripened  pods  has  led  to  a  just 
estimate  of  them.  I  am  inclined  on  the  whole  to  attempt  no  explanations  for  the 
anomalies  observed  in  these  seed  investigations.    The  influences  of  self-  and  cross- 

*  I  mention  this,  as  I  noticed  in  Yorkshire  fairly  frequent  but  apparently  ineffectual  visits  of  the  hive 
bees  to  sweet  peas.  It  is  possible  that  their  attempts  may  assist  the  self-fertilisation,  just  as  shaking  or 
wind  has  been  observed  to  do  in  other  species. 
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fertilisation  on  homotyposis  deserve  still  fuller  and  more  direct  investigation.*  I 
feel  we  know  little  as  to  the  influence  of  external  causes  even  on  the  completeness  or 
incompleteness  of  self- fertilisation.  So  far  as  homotyposis  in  the  pods  of  leguminous 
plants  is  concerned,  I  would  draw  conclusions  based  solely  on  averages,  and  state 
that  : — 

(i.)  The  homotyposis  in  the  case  of  either  ripe  or  aborted  seeds  in  the  pods  ot 
plants  seems  weakened  to  one-half  the  average  value  it  has  in  the  case  of  characters 
not  depending  on  fertilisation.  Thus  Darwin's  view,  that  differences  in  the  number 
of  ripe  seeds  depend  upon  the  constitution  of  the  plants,  seems  to  be  only  partially 
true.  Extraneous  causes  about  which  we  are  not  very  clear  appear  to  be  generally 
influential. 

(ii.)  The  extraneous  causes  which  act  in  a  random  manner  on  the  homotypic 
correlation  seem  to  affect  both  self-fertilised  and  cross-fertilised  plants,  and  this  both 
with  regard  to  ripe  and  to  aborted  seeds. 

(iii.)  The  order  of  intensity  of  homotypic  correlation  is  ovules,  ripened  seeds,  and 
aborted  seeds. 

The  reduction  of  homotypic  correlation  in  the  case  of  the  fertility  of  pods  may  be 
profitably  compared  with  the  like  reduction  which  we  find  in  the  case  of  the 
coefficients  of  inheritance  of  fertility  and  fecundity  in  man  and  the  horse,  t 

Other  points  worth  noting,  I  think,  in  our  results  are  those  of  Table  LIL,  which 
gives  the  correlation  between  ovules  and  rijje  and  aborted  seeds.  As  we  might 
anticipate,  the  more  ripe  the  fewer  aborted  seeds,  and  vice  versd.  The  correlation  is 
negative,  high  for  the  everlasting  pea,  remarkably  low  for  the  tare  vetch.  It  is 
interesting  to  see  that  the  correlation  between  the  ovules  and  either  ripe  or  aborted 
seeds  is  not  very  high.  A  large  number  of  ovules  not  necessarily  connoting  either 
a  very  large  number  of  ripe  or  of  aborted  seeds.  In  fact,  in  the  case  of  the  ever- 
lasting pea,  the  number  of  ovules  has  very  small  influence  indeed  on  the  number 
of  seeds  which  ripen.  In  the  tare  vetch  only  is  the  relationship  more  marked. 
Professor  F.  O.  Oliver  tells  me  that  in  certain  cases  evolution  appears  to  be  tending 
in  the  direction  of  the  pod  containing  one  ripe  seed  only.  It  seems,  therefore,  that 
such  relations  as  are  indicated  in  Table  LII.  (and  others  of  a  like  kind,  for  further 
observations  ought  certainly  to  be  made)  may  be  useful  in  indicating  the  degree  of 
fixity  between  the  number  of  ovules  and  the  number  of  ripe  or  aborted  seed  which 
are  ultimately  to  be  found  in  the  seed  vessel. 

While  the  ovules  have  the  least  and  the  aborted  seeds  the  greatest  variability,  as 

measured  by  the  coefficient  of  variation,  the  ovules  have  the  most  and  the  aborted 

seeds  the  least  homotypic  correlation.    This  might  at  first  sight  appear  to  be  opposed 

to  the  view  expressed  on  p.  363,  that  there  is  no  relationship  between  the  intensities 

*  It  seems  to  me  that  investigations  of  this  kind  ought  to  be  carried  out  by  those  who  have,  what  I 
unfortunately  have  not,  the  needful  land  for  experimental  investigations, 
t  'Phil.  Trans.,'  A,  vol.  192,  p.  277  et  seq. 
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of  variation  and  of  homotyposis.  But  it  is  really  not  so,  for  if  we  proceed  as  on  that 
page  to  divide  up  our  series  into  two  groups  in  order  of  their  variability,  we  find  that 
the  first  eight  give  an  average  coefficient  of  variation  of  22*68,  and  have  an  average 
homotypic  correlation  of  •1941  ;  the  last  eight  have  an  average  coefficient  of  variation 
of  51*47 — double  and  more  that  of  the  first  eight— while  their  homotypic  correlation 
is  '2207.  Thus,  so  far  from  the  homotyposis  being  reduced  by  greater  variability,  it 
actually  appears  to  be  slightly  increased. 

Table  LIT. — Relationship  between  the  Number  of  Ovules  and  those  of  Ripe  and 

of  Aborted  Seeds. 


Species. 

Correlation. 

Ripe  and  aborted. 

Eipe  and  ovides. 

Aborted  and  ovules. 

Lathyrus  Odorafiis  (i.) .    .  . 

„      (ii.)     .  • 

Lathyrus  Syhestris  .... 

-  -7692 

-  -8290 

-  -8797 

-  -2859 

•4581 
•3628 
•0992 
•6195 

•2155 
•2203 
•3660 
•5751 

Means     .    .  . 

-  -69095 

•3849 

• 

•3442 

Lastly,  we  may  notice  (Table  LI.)  that  the  variation  in  the  individual  amounts 
to  upwards  of  97  per  cent,  of  that  in  the  race,  or  we  have  another  strong  fact  to  aid 
in  demolishing  the  theory  that  variability  is  a  result  of  sexual  reproduction.  Note 
in  particular  the  97  per  cent,  in  the  case  of  the  ovules  whose  number  is  determined 
before  fertilisation ! 

Tables  XXXV.-XLIX.  give  the  data  from  which  the  constants  of  Tables  L.-LII. 
have  been  calculated. 
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PROFESSOE  K.  PEARSON  AND  -OTHERS  ON 


Table  XXXVII. — Latliijrus  Odoratus.    First  Series. 


Number  of  Ovules  in  First  Pod. 


5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

Totals. 

5 

7 

7 

3 

1 

18 

6 

22 

24 

23 

35 

20 

2 

126 

7 

24 

172 

282 

205 

134 

36 

2 

855 

8 

7 

23 

282 

1168 

786 

371 

81 

9 

2727 

9 

7 

35 

205 

786 

1114 

625 

132 

21  1 

2925 

10 

3 

20 

134 

371 

625 

592 

127 

9  1 

1881 

11 

1 

2 

36 

81 

132 

127 

42 

2 

423 

12 

2 

9 

21 

9 

2 

2 

45 

Totals 

18 

126 

855 

2727 

2925 

1881 

423 

45 

9000 

Table  XXXVIII. — Lathyrus  Odoratus.    First  Series. 


Numher  of  liipe  Seeda  in  First  Fod. 


0. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

Totals. 

0 

2 

1 

3 

2 

1 

9 

1 

1 

2 

14 

13 

8 

10 

2 

3 

1 

54 

2 

1 

2 

1 

9 

34 

23 

20 

6 

3 

0 



99 

3 

2 

1 

6 

45 

61 

59 

41 

33 

22 

8 

1 

279 

4 

14 

9 

45 

120 

193 

201 

164 

124 

41 

14 

2 

927 

5 

2 

13 

34 

61 

193 

426 

461 

351 

208 

86 

29 

6 

2 

1872 

6 

1 

8 

23 

59 

201 

461 

564 

418 

251 

111 

48 

3 

3 

2151 

7 

3 

10 

20 

41 

164 

351 

418 

338 

238 

87 

29 

7 

4 

1710 

8 

2 

2 

6 

33 

124 

208 

251 

238 

234 

76 

29 

3 

1206 

9 

1 

3 

3 

22 

41 

86 

111 

87 

76 

30 

15 

2 

477 

10 

1 

0 

8 

14 

29 

48 

29 

29 

15 

4 

3 

180 

11 

1 

2 

6 

3 

7 

3 

2 

3 

27 

12 

2 

3 

4 

9 

Totals 

9 

54 

99  , 

279 

927 

1872 

2151 

1710 

1206 

477 

180 

27 

9 

9000 
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Table  XXXIX, — Lathyrus  Odoratus.    First  Series. 


Number  of  Aborted  Seech  in  First  Pod. 


0. 

1. 

2. 

3. 

4. 

5. 

6. 

8. 

9. 

10. 

Totals. 

0 

78 

151 

167 

129 

92 

35 

12 

2 

666 

1 

151 

286 

388 

296 

225 

104 

37 

14 

1 

0 

1 

1503 

2 

167 

388 

636 

468 

375 

161 

61 

17 

1 

0 

3 

j  2277 

3 

129 

296 

468 

510 

355 

127 

72 

18 

2 

0 

3 

1980 

4 

92 

225 

375 

355 

362 

117 

61 

10 

3 

0 

2 

!  1602 

5 

35 

104 

161 

127 

117 

32 

16 

10 

1 

603 

6 

12 

37 

61 

72 

61 

16 

18 

1 

1 

279 

7 

2 

14 

17 

18 

10 

10 

1 

72 

8 

1 

1 

2 

3 

1 

1 

9 

9 

0 

0 

0 

0 

0 

10 

1 

3 

3 

2 

9 

Totals 

666 

1503 

2277 

1980 

1602 

603 

279 

72 

9 

0 

9 

9000 

Table  XL, — Lathyrus  Odoratus.    Second  Series. 


Number  of  Ovules  in  First  Pod. 


5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

1 

'  Totals. 

5 

2 

17 

24 

14 

6 

63 

6 

17 

62' 

95 

59 

26 

2 

261 

7 

24 

95 

196 

300 

150 

82 

8 

i  855 

8 

14 

59 

300 

1066 

689 

321 

34 

1 

2484 

9 

6 

26 

150 

689 

754 

439 

48 

3 

2115 

10 

2 

82 

321 

439 

394 

37 

3 

i  1278 

11 

8 

34 

48 

37 

6 

2 

135 

12 

1 

3 

3 

2 

9 

Totals 

63 

261 

855 

2484 

2115 

1278 

135 

9 

7200 
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PROFESSOR  K.  PEARPON  AND  OTHERS  ON 


Table  XLI. — Lathyrus  Odoratus.    Second  Series. 


Number  of  Ripe  Seeds  in  First  Pod. 


0. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

Totals. 

0 

2 

3 

2 

10 

13 

11 

3 

6 

3 

1 

54 

1 

3 

2 

8 

13 

19 

31 

11 

12 

15 

1 

1 

1 

117 

2 

2 

8 

22 

50 

44 

68 

45 

38 

29 

16 

2 

0 

324 

3 

10 

13 

50 

62 

85 

105 

67 

70 

46 

17 

4 

2 

531 

4 

13 

19 

44 

85 

108 

178 

149 

129 

65 

20 

5 

819 

5 

11 

31 

68 

105 

178 

300 

257 

251 

146 

42 

6 

0 

1395 

6 

3 

11 

45 

67 

149 

257 

278 

284 

174 

50 

20 

3 

1341 

7 

6 

12 

38 

70 

129 

251 

284 

258 

183 

71 

18 

3 

1323 

8 

3 

15 

29 

46 

65 

146 

174 

183 

118 

59 

23 

864 

9 

1 

1 

16 

17 

20 

42 

50 

71 

59 

36 

10 

\ 

324 

10 

1 

2 

4 

5 

6 

20 

18 

23 

10 

0 

1 

90 

11 

1 

0 

2 

4 

0 

3 

3 

3 

1 

1 

18 

Totals 

54 

117 

324 

531 

819 

1395 

1341 

1323 

864 

324 

90 

18 

7200 

Table  XLII. — Lathyrus  Odoratus.    Second  Series, 


Number  of  Aborted  Seeds  in  First  Pod. 


0. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

Totals. 

0 

112 

166 

138 

116 

57 

27 

25 

15 

6 

3 

1 

666 

1 

166 

266 

280 

230 

131 

51 

39 

26 

13 

4 

0 

1206 

2 

138 

280 

396 

312 

249 

115 

71 

29 

18 

10 

2 

1620 

3 

116 

230 

312 

278 

237 

106 

75 

18 

23 

6 

3 

1404 

4 

57 

131 

249 

237 

156 

113 

82 

30 

19 

5 

1 

1080 

5 

27 

51 

115 

106 

113 

48 

36 

18 

12 

4 

1 

531 

6 

25 

39 

71 

75 

82 

36 

30 

12 

4 

4 

0 

378 

7 

15 

26 

29 

18 

30 

18 

12 

8 

6 

0 

0 

162 

8 

6 

13 

18 

23 

19 

12 

4 

6 

6 

0 

1 

108 

9 

3 

4 

10 

6 

5 

4 

4 

0 

0 

36 

10 

1 

0 

2 

3 

1 

1 

0 

0 

1 

9 

Totals 

666 

1206 

1620 

1404 

1080 

531 

378 

162 

108 

36 

9 

7200 
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Table  XLIII. — Lathyrus  Sylvestris.    Sea  Coast,  Dorsetshire. 


Xumber  of  Ovules  in  Fird  Foil. 


8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

Totals. 

8 

2 

3 

3 

0 

1 

9 

9 

0 

0 

0 

0 

0 

0 

10 

2 

9 

7 

0 

0 

0 

18 

11 

5 

10 

28 

,  10 

1 

0 

54 

12 

2 

5 

28 

90 

167 

38 

2 

1 

333 

13 

2 

0 

9 

10 

90 

486 

837 

348 

51 

3 

1836 

U 

3 

0 

7 

28 

167 

837 

2162 

920 

143 

17 

4284 

15 

3 

0 

0 

10 

38 

348 

920 

610 

121 

20 

2070 

16 

0 

0 

0 

1 

2 

51 

143 

121 

30 

3 

351 

17 

1 

» 

0 

0 

1 

3 

17 

20 

3 

0 

45 

Totals 

9 

0 

18 

54 

333 

1836 

4284 

2070 

351 

45 

9000 

Tablje  XLIV. — Lfitliyrus  Sylvestris.    Sea  Coast,  Dorsetshire. 


Xiimber  of  liipe  Seeds  m  First  Pod. 


0. 

1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

Totals. 

0 

1 

1 

1 

0 

1 

1 

4 

9 

1 

4 

4 

16 

12 

11 

7 

7 

1 

1 

63 

2 

4 

16 

10 

40 

33 

37 

29 

5 

6 

0 

180 

3 

1 

4 

10 

42 

146 

170 

135 

65 

53 

20 

12 

6 

2 

666 

4 

1 

16 

40 

146 

412 

378 

299 

197 

86 

42 

16 

13 

1 

1647 

5 

1 

12 

33 

170 

378 

476 

426 

267 

145 

77 

33 

16 

9 

2043 

6 

0 

11 

37 

135 

299 

426 

402 

251 

163 

84 

31 

13 

2 

1854 

7 

1 

7 

29 

65 

197 

267 

251 

168 

107 

59 

19 

15 

3 

1188 

I 

1 

7 

5 

53 

86 

145 

163 

107 

70 

39 

■  24 

10 

1 

711 

4 

1 

6 

20 

42 

77 

84 

59 

39 

38 

12 

5 

387 

10 

1 

0 

12 

16 

33 

31 

19 

24 

12 

2 

3 

153 

11 

6 

13 

16 

13 

15 

10 

5 

3 

81 

12 

2 

1 

9 

2 

3 

1 

18 

Totals 

9 

63 

180 

666 

1647 

2043 

1854 

1188 

711 

387 

153 

81 

18 

9000 
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Table  XLV. — Lathp-us  Sylvestris.    Sea  Coast,  Dorsetshire, 

Nimher  of  Aborted  Seeds  in  First  Pod. 


2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16.1 

Totals. 

2 



1 



6 

— - 

4 

2 

1 

7 

5 

1 

■ 

27 

3 

— 

3 

4 

11 

21 

18 

13 

10 

8 

2 

_ 

90 

4 

1 

3 

Z 

1  Q 

lo 

9  J. 

97 

it  V 

1  X 

o 

0 

0 

1  ; 
J' 

162 

5 

6 

4 

12 

28 

46 

61 

73 

66 

45 

25 

7 

5 

0 

378 

6 

4 

11 

18 

46 

84 

155 

141 

123 

107 

42 

15 

6 

0 

4 

0 

756 

7 

2 

21 

23 

61 

155 

250 

286 

262 

164 

123 

41 

14 

0 

2 

0 

1404 

8 

1 

18 

24 

73 

141 

286 

344 

354 

241 

143 

41 

16 

0 

0 

1 

1683 

9 

7 

13 

27 

66 

123 

262 

354 

396 

305 

195 

55 

22 

0 

0 

2 

1827 

10 

5 

10 

29 

45 

107 

164 

241 

305 

254 

148 

65 

19 

0 

1 

2 

1395 

11 

1 

8 

14 

25 

42 

123 

143 

195 

148 

100 

39 

I 

1 

1 

855 

12 

2 

5 

7 

'  15 

41 

41 

55 

65 

39 

18 

1 

1 

297 

13 

3 

5 

6 

14 

16 

22 

19 

15 

7 

I 

0 

0 

108 

14 

0 

0 

0 

0 

0 

0 

0 

0 

? 

0 

15 

0 

0 

4 

2 

0 

0 

1 

1 

0 

9 

16 

1 

0 

0 

0 

1 

2 

2 

1 

1 

1 

9 

Totals 

27 

90 

162 

378 

756 

1404 

1683 

1827 

1395 

855 

297 

108 

0 

9 

9000 

1 

Table  XL VI, —  Vicia  Faho.    Dauby  Dale,  Yorkshire. 

Number  of  Ripe  Seeds  in  First  Pod. 


0. 

1. 

2. 

3. 

4. 

5. 

Totals. 

0 

46 

38 

35 

120 

76 

315 

1 

38 

46 

70 

142 

89 

387 

2 

35 

70 

274 

608 

386 

4 

1377 

3 

120 

142 

608 

1542 

1237 

23 

3672 

4 

76 

89 

386 

1237 

1332 

3168 

5 

4 

23 

48 

1 

81 

Totals 

315 

387 

1.377 

3672 

3168 

81 

9000 

Table  XLVII. —  Vicia  Hirsuta.    Danby  Dale,  Yorkshire. 

Nurnher  of  Ovules  in  First  Pod. 


1. 

2. 

3. 

4. 

6. 

Totals. 

1 

6 

22 

33 

21 

4 

86 

2 

22 

98 

169 

81 

21 

2 

393 

3 

33 

169 

360 

292 

86 

10 

950 

4 

21 

81 

292 

194 

57 

11 

656 

5 

4 

21 

86 

57 

38 

6 

212 

6 

10 

11 

6 

2 

31 

Totals 

86 

393 

1 

950 

656 

212 

31 

2328 

g 
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Table  XLVIII. —  Vicia  Hirmta.    Danby  Dale,  Yorkshire. 

Number  of  Ripe  Seeds  in  Fird  Pod. 


0. 

1. 

2. 

3. 

4. 

5.  ' 

Totals. 

0 

3 

1 

1 

5 

1 

160 

165 

89 

31 

445 

2 

3 

165 

492 

304 

45 

11 

1020 

3 

1 

89 

304 

278 

38 

6  ; 

716 

4 

1 

31 

45 

38 

10 

120 

5 

11 

6 

22 

Totals 

5 

445 

1020 

716 

120 

22  ! 

1 

2328 

Table  XLIX. — Vicia  Hirsuta.    Danby  Dale,  Yorkshire. 

Number  of  Aborted  Seeds  in  First  Pod. 


0. 

1. 

2. 

3. 

4. 

Totals. 

0 

160 

263 

8S 

9 

6 

526 

1 

263 

788 

176 

52 

10 

1289 

2 

88 

176 

92 

16 

16 

388 

3 

9 

52 

16 

10 

2 

89 

4 

6 

10 

16 

2 

2 

36 

Totals 

526 

1289 

388 

89 

36 

2328 
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XI.  Mathematical  Contributions  to  the  Theory  of  Evolution, — X.  Supplement 

to  a  Memoir  on  Skew  Variation* 

By  Karl  Pearson,  F.R.S.,  University  College,  London. 

Received  May  22,— Read,  June  20,  1901. 

(1.)  In  a  memoir  on  Skew  Variation  published  in  the  'Phil.  Trans.,'  A,  vol.  186, 
1895,  a  series  of  frequency  curves  are  discussed  which  are  integrals  of  the  differential 
equation 

1  dj/  —X  ,.  - 

(See  p.  381  of  the  memoir.) 

The  discussion  of  four  main  types  is  given  in  detail,  and  a  brief  reference  is  made 
to  various  sub-types  which  may  occur.  The  types  considered  in  that  memoir  covered 
at  the  time  all  the  frequency  series,  and  they  were  fairly  numerous,  that  I  had  had 
occasion  to  deal  with.  In  the  course  of  the  last  few  years,  however,  I  have  been 
somewhat  puzzled  by  frequency  distributions  for  which  the  criterion  2y82  —  3y8j  —  6 
{see  p.  378)  M^as  positive,  and  therefore  d priori  a  curve  of  the  type 

1/  —  5/  ^  a—"       ~  '  (•'-■'«) 

was  to  be  expected,  but  which  on  calculation  gave  v  imaginary.  The  frequency 
distributions  in  question  aroset  occasionally  in  sociological  statistics,  but  also  in 

*  'Phil.  Trans.,'  A,  vol.  186,  p.  343. 

t  Some  other  frequency  distributions,  which  on  first  investigation  fell  under  Types  V.  and  VI.  of  the 
present  paper,  were  found  with  improved  values  for  the  moments  to  fall  under  types  already  discussed. 
Mr.  W.  F.  Sheppard's  values  for  the  moments  ('  Lond.  Math.  Soc.,'  vol.  29,  p.  369,  formula  30)  should 
certainly  be  used  in  preference  to  those  given  by  me  ('Phil.  Trans.,'  A,  vol.  186,  p.  350)  whenever  we  are 
calculating  the  moments  of  a  curve  from  areas  and  not  from  true  ordinates.  I  hope  shortly  to  publish  a 
paper  on  this  point,  which  is  one  really  of  quadrature  formulae.  Meanwhile  for  every  true  frequency 
•curve  with  high  contact  at  both  terminals  we  ought  to  use 

/X2  =      (V2'  -  Vi'2  -  J^) 

/X,  =  -  4V1V3'  +  6vi'V  -  3vi'<  -  i(l'2'  -  ''i'2)  +  ^l<j), 

instead  of  the  values  given  on  p.  350,  jj-z  remaining  unchanged. 
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biological  investigations.  It  seemed,  therefore,  desirable  to  enter  a  little  more 
fully  into  the  analysis  of  the  cases  in  which  tlie  criterion  was  positive  but  v 
imaginary,  and  discover  what  types  of  frequency  curves  had  escaped  my  attention.*' 

The  key  to  the  solution  lies  in  the  fact  noted  on  p.  369  of  the  memoir,  namely,  that 
even  if  the  criterion  be  positive,  there  will  still  be  a  solution  akin  to  Type  I.  and  not 
to  Type  IV.  if  e  be  negative.  No  frequency  series  satisfying  these  conditions  had  at 
that  time  come  inider  my  notice,  and  later,  when  collecting  data  of  floral  variability, 
my  own  remark  as  to  e  had  slipped  from  my  memory.  It  is  the  object  of  this 
supplement  to  obtain  an  improved  criterion  of  type,  to  discuss  the  nature  of  the 
curves  which  fill  the  gap  observed,  and  to  illustrate  by  one  or  two  examples  th& 
fitting  of  such  curves  to  actual  statistics. 

(2.)  The  Two  Criteria. 

Throughout  this  supplement  the  notation  of  the  previous  memoir  will  be  assumed 
to  be  familiar  to  the  reader. 

Turning  to  p.  378  of  that  memoir,  we  note  that  since  yS^  and  r  —  I  are  necessarily 
positive,  z  if  positive  must  be  >  r^.  Hence  v  can  only  become  imaginary  if  z  be 
negative,  or 

16  (r  -  1) 

Siibstitute  in  this  the  value  of  r  and  it  becomes 


/3,  (A  +  •^)- 


4(4/3^-  3/3i)(2^„-3^i-  6) 


>1  


Hence  the  complete  condition  that  a  curve  of  Type  IV.  shall  give  the  distribution 
of  frequency  is  not  only 

/fj  =  2;S.  -  3y8i  -  6  >  0, 

but  also 

~      4  (4/32  -  3/3,) (2^3  _  3;Si  -  6)  ^  ^• 

Turning  back  to  p.  369,  we  see  that  e  being  positive  the  complete  conditions  for  a 
curve  of  Type  I.  giving  the  distribution  of  frequency  are 

=  2y8o  -  3/3i  —  6  <  0, 

*  I  was  very  loath  to  adopt  Professor  Edgeworth's  method  of  inventing  new  frequency  curves  hy 
putting  X  =  /(,o')  in  a  normal  frequency  distribution,  y  =  yoe-<^--'\  Besides  strong  theoretical  objections  to. 
this  process,  I  had  found  Equation  (i.)  so  sufficient  for  a  great  variety  of  cases  that  I  felt  confident  it  must 
cover  the  newly  discovered  outstanding  cases,  and  this  confidence  seems  justified  by  the  result. 


CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION. 


445 


and 

^{4/3, -3^,)  (2^,-3/3,-6)  ^  ' 

The  latter  condition  will  be  always  satisfied  since  /Sj  and  4^^  —  3^^  are  positive  for 
any  distribution  whatever,  and  2/3o  —        —  (3  is  negative  by  hypothesis. 
Further,  in  the  previous  case     is  seen  to  be  essentially  positive. 
Hence  the  criteria  written  down  cover  all  possible  cases  but  those  for  which 

K,  >  1. 

Sub-cases  which  arise  from  transition  curves  just  at  the  limits  will,  however,  be 
hkely  to  be  of  interest,  What  happens  when  k.,  =  co  and  when  =  1  ?  The  only 
possibility  for  /<„  =  oo  is  2^.  —  S/Sj  —  6,  or  =  0.  But  this  curve  has  been  fully 
treated  under  Type  III,  in  the  memoir. 

We  shall  see  later  that  k.^  =  1  leads  us  up  to  a  novel  transition  curve  of  consider- 
able interest. 

To  ascertain  something  about  the  general  case  in  which  /Cg  >  I,  let  us  return  to 
the  memoir  again  and  examine  the  value  of  e  on  p.  369.    It  can  only  be  negative  if 

4  +  i;8i(v  +  2)V(r-t-  1)  be   <  0, 

where  r  is  here  _  6(^-^.-1) 

where  7  is  here  "  3 /S^  -  2 /3,  +  6  *  ^ 


Substituting,  we  find  at  once 


«2  >  1, 


which  in  itself  involves     >  0. 

Hence  the  missing  gap  corresponds  to  those  cases  in  which  e  is  negative. 

It  will  be  clear  that  Ko,  although  in  form  giving  a  more  complex  criterion  than  k^, 
is  really  more  efi'ective,  as  covering  all  the  possible  cases.  We  have  then  the 
following  scheme  : — 


Criterion  ko. 

Corresponding  frequency  curve. 

K2  >  1  &  <  00      .     .     .  . 

'<2  =  1  

K2  >  0  &  <  1  

Kg  =  0,  ^1  =  0,  y8,  =  3.  . 
/C2  =  0,  ;8i  =  0,^2  not  =  3 
'^2  <  0  

Transition  curve,  Type  III.  (Memoir,  p.  373). 

Type  VI.  (see  p.  448  below). 

Transition  curve,  Type  V.  (see  p.  446  below). 

Type  IV.  (Memoir,  p.  376). 

Normal  curve. 

Type  II.  (Memoir,  p.  372). 

Type  I.  (Memoir,  p  367). 
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The  object  of  this  supplement  is  to  discuss  the  calculation  of  curves  of  Type  V., 
and  to  consider  those  of  Type  VI.  somewhat  more  at  length,  they  being  only  briefly 
referred  to  on  p.  369  of  the  memoir.  It  will  be  seen  that  Type  I.  of  the  memoir  has 
now  broken  up  into  two  divisions.  One  portion  is  the  old  Type  I.  passing  into  the 
normal  curve  on  one  side  and  Type  III.  on  the  other.  This  Type  III.  separates  the 
second  portion,  Type  VI.,  of  the  old  Type  I.  from  the  first  portion.  Type  VI.  passes 
from  Type  III.  to  the  new  transition  curve  Type  V.,  whicli,  hke  Type  III.,  will  be 
found  to  have  a  range  limited  in  one  direction  only.  Finally  this  new  Type  V.  is  the 
transition  to  the  old  Type  IV.  bounded  on  the  other  side  by  the  sub-curve,  the  old  ^IW* 
Type  11. ,  and  beyond  that  the  normal  curve.  Thus  we  see  that  Types  I.  and  IV.  do 
not  pass  directly  into  each  other  through  Type  III.,  as  might  be  supposed  by  the 
criterion  K|  >  or  <  0,  but  that  there  are  a  series  of  intervening  curves,  two  of  which, 
Types  V.  and  VI.,  require  further  consideration,  if  we  are  to  complete  the  whole 
round  of  frequency  distributions  embraced  under  the  differential  equation  (i.). 

(3.)  On  the  Frequency  Cui've  of  Type  V. 

Returning  to  the  fundamental  differential  equation  (i.),  let  us  consider  what 
transformation  takes  place  when  the  denominator  on  the  right  has  equal  roots.*  We 
may  then  write  it  in  the  form 

Idy  —  a;  Cj       1  1 

ydx"  Cq{c^  +  xf  ~~  Co  (cj  +  xf      c,,  (c^  +  x)  ' 

c       1  1 

Hence  log  y  =  -  7  TT^ZV^  ~  r  +  ^)  +  const. 

Thus  2/  =  2/o  e  ""^^  (ci  +  x)-P, 

where,  y^  is  a  constant,  y  =  cjc^^  and  p  =  l/c\y    Thus  changing  the  origin  we  may 
write  the  curve  : 

y  =     x-P  e-y^'^  (iii-)' 

where  x^q 

=  y/P  gives  the  distance  of  the  mode  from  the  new  origin. 

To  find  the  moments  about  this  origin,  we  notice  that,  p  and  y  being  positive, 
y  =  0  when  x  =  0  and  when  x  =  co  .  Thus  as  in  the  curve  of  Type  III.  we  have  a 
range  limited  at  one  end  only. 

To  find  the  moments  we  have,  if  a  be  the  area, 

aja'„=  I  y^x'P"^"  e"'''  dx  (iv.). 

J  0 

*  I  owe  to  Miss  Agnes  Kelly,  Ph.D.,  the  suggestion  that  this  type  of  frequency  curve  deserved  fuller 

treatment. 


Put 


Thus 
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yjx  =  z,  and  we  find 

J  0 

=  2/o7"~^'^^  r  (p— 1)   (v.). 

azzzy^y^-J'Tip-  1)   (vi.)- 


i^i'  =  r/(p-2) 

/^3'  =  yV{^-2)  (p-3)(p-4) 

;x;  =  yV  {2)  -  2)  (/>  -  3)  {p-i){p-  5) 


(vii.). 


Transferring  to  the  centroid  we  find 

T 


H-2 


(^p'^^2)^p  -3)(p  -  4) 

 Sip  +  4)7* 

^■^  ~    -2y(p-  3) 0^  -  4)(^  -  5) ; 

16  {p  -  3) 
(i'  -  4)2  • 

30; +  4)  (i?-3) 


(viii.). 


^1  =  l^^lf-^  = 


0^  -  5)     -  4)     •    •    •  • 

Eliminating'    between      and  /S^  we  find  after  some  reductions  : 
^1      +  3)^  =  4  (2^82  -       -  6)  (4^3  -  3^) 


(ix.). 


or, 


(xi.). 


Clearly,  since  this  is  the  condition  for  Type  V.,  that  transition  curve  is  none  other 
than  the  curve  obtained  by  making  the  denominator  of  the  right-hand  side  of  the 
differential  equation  have  equal  roots.  The  curve  is  clearly  of  considerable  interest, 
and  its  existence  had  not  been  noticed  in  the  previous  series  of  frequency  curves. 

The  manner  of  fitting  it  is  now  easily  described. 

Equation  (ix.)  gives  us  a  quadratic  to  find  p  —  4  : 


(,,_4)-^(^_4)-^=0  . 


The  positive  root  of  this  is  the  required  solution. 
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y  is  then  found  from  the  first  of  equations  (viii.),  or  if  a  be  the  standard  deviation 

=  ^/fio,  then   

y  =  a-{p-2).  s/{p-S)  (xiii.V 

Then  (vi.)  gives  :  ^ 

?/o=p^ri)  (^iv.), 

which  determines      the  remaining  constant  for  the  shape  of  the  curve. 

For  the  position  of  the  curve,  we  have  for  the  distance  from  origin  to  mean,  from 
the  first  of  equations  (vii.)  : 

f,/=y/{p-2)  =  <r^/{p-S)   (XV.). 

If  d  be  the  distance  from  mode  to  mean  we  have  : 

d  =  fi,'  -  y/p 

7         7           27  . 
=   —     =   ^  (xvi.). 

Further,  the  skewness : 

Sk.  =  c//o-  =  ^^^^"^^   -  .  (xvii.). 

Thus  the  solution  is  completed. 


(4.)  On  the  Frequency  Curve  of  Type  VT. 

Type  VI.,  as  we  have  seen,  corresponds  to  the  case  in  which  Type  I.  of  the  memoir 
has  its  e  negative.  Hence  either  or  m.^'  is  negative  and  the  curve  transfening  the 
origin  takes  the  form 

y  =  yQ{x  —  ay'/x'""  (xviii.). 

Now  it  is  possible  that  this  curve  falls  under  the  limited  range  type  of  a  frequency 
from  ic  =  0  to  A'  =  a,  but  as  we  see  that  the  criterion  places  Type  VI.  between  tAvo 
CTirves  of  range  limited  in  one  direction  only,  we  expect  Type  VI.  also  to  be  of  that 
character,  and  a  complete  solution  is  obtained  by  taking  the  range  from  x  =  a  to 
X  =  00  ;  this  indeed  fills  up  the  gap  for  /c^  >  1  and  <  00  ,  and  (xviii.)  with  this  range 
is  seen  to  pass  into  one  or  other  of  the  two  transition  curves 

2/  =  2/0  xJ'e-y\ 

or  y  =  y^x-Pe-y\ 

according  as  we  allow  the  first  or  second  factor  to  approach  a  limit,  t 

■*  The  sign  of  /X3  Avill  determine  the  sign  of  y,  or,  what  may  be  taken  as  the  same  thing,  the  direction 
of  the  axis  of  x. 

t  Write :  y  =  const,  x         (1  -  xja)"'i-  and  make  nh  =  00,  and  ft  =  00  but  niija  finite. 
Or,  y  =  const,  x  (1  -  ft/.r)"'i/a;"'a-»ii,  and  make  a  —  0,  Wi  =  00,  and  a  x  m■^  together  with  )ih2  -  nii 
finite. 
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(xix.), 


and  take  the  range  from  ce  to  oo . 
Differentiating  to  find  the  position  of  the  mode  we  have 


9i  -  % 


For  the  moments  about  the  origin  : 

af'..  =  )^  2/0 -^JT— 

Put  ajx  =  hence 

=  .  B  (q,  -q.-n-^\,q,  +  1) 

_     yo     r(gt  1)  r(g,  +  i) 


Hence  we  deduce 


a  = 


= 


/*3  = 


^4  = 


a3i-9a-l 

a(^i  -  1) 


yo    rfa->z2-i)r(gc,  +  i) 


?i  -  &  -  2 

(?i  -  ?3  -  2)      -  !Z3  -  3J 

^^(gi-l)(gi-2)(gi-3) 
(!Zi  -  ?2  -  2)(!Zi  -  (Z2  -         -  ??2  -  4) 

a'^(g,-l)(gi-2)fa-?,)(g,-4) 
(!Zi  -  ?3  -  2)  {ix  3)(?i  -  ?2  -  4)('Zi  -  22  -  5)J 


(XX.). 


(xxi.). 


(xxii.). 


Now  if  we  compare  these  results  with  those  on  p.  368  of  the  earlier  memoir  we  see 
that  the  one  set  can  be  at  once  deduced  from  the  other  by  writing  =  —  q^,  =  g'^. 
Thus  with  this  interchange  the  whole  of  that  solution  holds,  if  we  bear  in  mind  that 
the  range  is  now  from  ,t  =  a  to  oo  . 

We  easily  find  ; 


VOL.  CXCVII. — A. 


-  <Zi  +     +  2  €  =  1  -  (?,  +  ^2  -  q^q^ 
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and  1  —  7 1  and  q.^ -\-  I  are  the  roots  of 

z^  —  rz-{-e=0  (xxiii.), 

where  r  and  e  are  to  be  determined  as  in  that  memoir,  pp.  368-369. 
We  have  : 

/^3  =   +  1)  (XXIV.), 

where  1  —  and  r  are  both  negative.  This  gives  a*  Thns  q^,  and  a  are  known, 
and  from  Equation  (xxi.) 

we  find  the  remaining  unknown  constant  for  tlie  shape  of  tlie  curve,  y^.  As  before, 
various  approximations  may  be  used  to  the  vahies  of  the  r  functions  when  either  q^ 
or     or  both  are  large,  t 

We  easily  obtain  for  the  distance  between  mode  and  mean 


and  for  the  skewness 


,7  ^^(gi  +  '?^)   (xxvi  ) 


 (li  +  g.)v/(li-g2-3)^  ,      .J . 


(5.)  A  special  case  of  some  interest  arises  when  the  start  of  the  curve  is  d  priori 
known.  Suppose  its  distance  from  the  mean  to  be  c  and  let  (using  moments  about 
centroid) 

i^i/c^  =  y2>    /^aA^/u-sc)  =  73  (xxviii.). 

Then  we  easily  find  : 

 1  -  !Zi   _  gi  +  % 

(l+!/.)(-gi+'/2  +  3)'  0-  +  g.)(gi  -  go  -  4)  • 


*  1  -     being  negative,  e  is  negative,  and  accordingly  by  what  goes  before     lies  between  1  and  oo  . 
t  The  value  of  yn  for  curves  of  Type  I.,  if  mi  be  small  but      large  ('Phil.  Trans.,'  A,  vol.  186,  p.  369, 
foot-note),  is 

o.  I      ,        ,  ^^       /??«!  +  m--,  -A-(— '  L) 

?/o  =  ^       +  m,  +  1)  y  ,12      H- +  1) ' 

and  this  can  be  easily  modified  to  suit  (xxv.)  above.  A  very  convenient  and  exact  formula  for  F  (n  +1), 
if  n  be  large,  is  that  given  by  Forsyth  ('B.A.  Report,'  1883,  p.  47) : 

r(.  +  1)  =  J2^lj^^>^'^^r\n  +  h 

the  error  beino;  less  than  — - —  of  the  whole, 
°  240ft2 
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Whence  we  deduce  to  determine     and  : 

1  -  +  47.;  ,  73(1  +  73)(73  -  1  -  -^73)  r        '  \ 

72  -  73  (^72-73  +  7373)(73-73) 

and  the  solution  proceeds  as  before. 

(6.)  Illustrations. — I  propose  to  note  a  few  distributions  of  frequency  in  which  I 
have  come  across  Types  V.  and  VI. 

(A.)  Statistics  of  Age  of  Bride  at  Marriage,  the  Bridegroorri  s  Age  being  betiveen 

24  and  25  years.'^' 

The  observations  given  in  the  table,  p.  454,  are  taken  from  Perozzo's  memoir  : 
"  Nuove  Applicazioni  del  Calcolo  delle  Probabihta  Reale  Accademia  dei 

Lincei;  Anno  CCLXXIX.,  1881-2,  Tavola  1. 

The  total  number  of  recorded  marriages  is  28,454.  The  moments  were  calculated 
by  using  Sheppard's  corrections  ('  London  Math.  Soc.  Proc.,'  vol.  29,  p.  3G9),  and  are 
as  follows  : — 

Mean  age  of  bride  =  22-1877. 

fi^  z=  13-3346 
/X3=  67-8145 
IM^  =  1224-6342 

Whence:                                  0^=  1-9396 

/3.^=  6-8873 

=  1-9558 

K.=  1-1094 


Thus  by  p.  445  we  see  that  Type  VI.  is  the  frequency  curve  to  be  selected,  but  as 
/C3  does  not  differ  widely  from  unity,  we  shall  probably  get  a  good  fit  from  Type  V. 
as  well. 

Taking  Type  VI.  first,  we  find  : 

r=-  12-11075,  €=-  317-84987. 

The  quadratic  (xxiii.)  is  accordingly  :  ' 

+  12-11075  z  —  317-84987  =  0. 

*  I  selected  this  example  at  random,  as  one  out  of  several  leading  to  the  curve  types  it  was  my 
object  to  illustrate.  There  is  so  much  tampering  with  statistics,  however,  whenever  they  refer  to  the  ages 
of  women,  that  it  would  probably  have  been  better  to  have  used  the  men. 

3  M  2 
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Thus :  q,  =  25-88401,  q.2  =  11-77326. 

Hence  by  (xxiv.  )  a  =  8-268,405, 

andby(xxv.)  log  ^/^  =  24-275,3032. 

We  have  accordingly  for  the  equation  to  the  cvu've  : 

,/=  10=*  XI -884,965  (^-^^2^  . 

The  distance  from  the  origin  to  the  mean  is  given  by  the  first  equation  of  (xxii.) : 

ix^'  =  16-98913, 

or,  the  theoretical  range  starts  with  brides  of  5*198,570  +  8-268,405  =  13*466,975 
years.  This  is  an  excellent  underlimit  to  the  age  of  women  marrying  men  of  24  to 
25  in  a  country  like  Italy.  Our  first  group  is  at  15 '5,  and  the  above  start  is  just  two 
base  units  before  this  initial  group. 

The  skewness  =  *498,953,  and  the  distance  from  mode  to  mean  =  1*822,004,  or 
the  mode  is  at  20*3657  years. 

Turning  now  to  Type  V.  we  have  the  following  results  : — 

16/)8i  =  8-249,262. 

Hence  Equation  (xii.)  is  : 

(p  -  4)2  _  8*249,262  (p  -  4)  -  8*249,262  =  0. 
Thus  the  positive  value  of  ^  is  : 

_p  =  13-150,747. 

Equation  (xiii.)  gives  : 

7=  129-73081. 

Then  (xiv.)  gives  : 

log  yQ  =  22-367,6952. 

Thus  the  equation  to  the  curve  is  : 

y  =  10^3  X  2-331,821  a;-i3-i5o,747  e-israi/,.^ 
To  find  the  position  of  its  start  we  have  by  (xv.)  : 

/t/=  11-6343, 

or,  since  the  mean  age  of  brides  is  22-1877,  the  youngest  possible  theoretical  bride  is 
10-5534  years.  This  is  probably  a  worse  determination  of  the  underlimit  than  in  the 
case  of  Type  VI.    At  the  same  time  I  notice  that  out  of  about  180,000  women,  101 
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were  married  between  14  and  15  years  of  age,  and  all  the  curves  begin  with  a 
sensibly  finite  ordinate  at  14-5  ;  it  is  accordingly  possible  that  a  somewhat  lower  age 
than  13 '5  actually  occurs  in  Italy. 

Equation  (xvi.)  gives  us  for  the  distance  from  mode  to  mean  : 

(/=  1-7694, 

or  the  modal  age  at  marriage  is  20-4183  years.  This  is  only  aljout  "053  of  a  year  or 
about  19  days  different  from  the  modal  age  as  given  Ijy  Type  VI.,  a  most  satisfactory 
agreement. 

For  the  skewness  we  have  from  Equation  (xvii.)  : 

Sk.  =  -4845, 

or,  it  differs  by  less  than  3  per  cent,  from  the  skewness  as  given  by  Type  (VI.). 

The  diagram  (fig.  i.)  shows  the  two  curves,  and  the  table  compares  the  results 
obtained  from  either  with  the  observations.* 
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A§e  of  Bride. 


Type  V. 


Type  VI. 


It  is  clear  that  for  all  practical  purposes  the  curve  of  Type  V.  is  as  good  as  that 
of  Type  VI.    Indeed,  there  is  practically  no  difference  between  them  except  for  the 

*  The  observation  data  are  really  areas,  while  to  save  lengthy  calculations  we  have  compared  both  in 
diagram  and  table  the  ordinates  of  the  theoretical  curves.  This  is  in  general  legitimate,  if,  as  in  this 
case,  the  number  of  groups  is  very  large. 
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ages  15  to  17.  The  fit  is,  however,  not  a  very  good  one,  and  although  it  is 
indefinitely  better  than  a  normal  curve,  and  we  see  why  in  the  absence  of  these 
types  the  statistics  could  not  be  fitted  with  any  of  the  first  series  of  skew  curves, 
yet  we  are  compelled  to  consider  that  there  are  causes  other  than  chance  at  work 
very  definitely  affecting  the  frequency  of  the  recorded  ages.  Thus  the  bridegrooms 
being  24  to  25,  the  desire  of  the  bride  to  be  recorded  as  younger  than  her  husband 
probably  fully  accounts  for  the  bulk  of  the  preponderance  of  observation  over  theory 


Table  of  Observed  and  Calculated  Frequencies. 


Age. 

Observed 
frequency. 

Calculated  frequency. 

Age. 

Observed 
frequency. 

Calculated  frequency. 

Type  V. 

Type  VI. 

Type  V. 

Type  VI. 

15-16 

367 

70 

49 

30-31 

256 

281 

282 

16-17 

717 

514 

489 

31-32 

164 

201 

198 

17-18 

1294 

1538 

1560 

32-33 

134 

148 

146 

18-19 

2121 

2751 

2800 

33-34 

94 

104 

105 

19-20 

3156 

3591 

3622 

34-35 

77 

75 

76 

20-21 

4009 

3830 

3831 

35-36 

68 

55 

55 

21-22 

3593 

3577 

3560 

36-37 

59 

40 

40 

22-23 

3604 

3055 

3034 

37-38 

33 

29 

29 

23-24 

3060 

2456 

2439 

38-39 

40 

21 

22 

24-25 

1774 

1894 

1884 

39-40 

27 

16 

16 

25-26 

1353 

1419 

1415 

40-41 

18 

12 

12 

26-27 

936 

1044 

1043 

41-42 

21 

9 

9 

27-28 

663 

758 

760 

42-43 

11 

7 

7 

28-29 

468 

546 

549 

43-44 

14 

5 

5 

29-30 

319 

392 

395 

44-45 

4 

4 

in  the  frequency  of  the  brides  of  22  to  24.  The  defect  of  brides  between  17  and 
20  may  be  again  due  to  the  tendency  to  state  the  age  as  over  21,  and  so  free 
the  woman  from  the  need  for  parental  sanction.*^  These  causes,  giving  a  false 
displacement  of  age  frequency,  are  probably  in  themselves  sufficient  to  account  for 
the  theoretical  defect  in  brides  of  15  to  17. 


(7.)  (B.)  071  the  Variation  in  the  Number  of  Lips  of  the  Medusa  P.  Pentata. 

My  data  are  the  following,  taken  from  a  paper  by  Alfred  Goldsborough  Mayer  : 
"  The  Variations  of  a  Newly  Arisen  Species  of  Medusa,"  '  Science  Bulletin  of  the 
Museum  of  the  Brooklyn  Institute,'  vol.  1,  p.  1,  1901. 


*  I  have  found  in  England  the  statement  of  the  In'ide's  age  in  the  marriage  licence  is  for  the  same 
reason  occasionally  not  in  accordance  with  the  year  of  birth  as  shown  by  the  parish  register. 
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Total 


Frequency. 

No.  of  lips 

2 

1 

5 

2 

18 

3 

123 

4 

798 

5 

49 

6 

1 

7  " 

996 

Mr.  Mayee  (p.  12)  notes  the  failure  of  my  curve  of  Type  IV.  I  find  for  the 
constants  : 

Mean  =  4-8685  lips. 

fjL.^  =  -309,006,  o-  =  -55588 

=  —  -350,697 
/x^  =  1-181,718 
y8i  =  4-16834 

=  12-37598 
Ki  =  6-24694 
Ko=  1-06594 

Since     is  so  nearly  unity  we  may  use  Type  V. 
Hence  I  find  : 

p=  8-66184  y=-  8-811634 

(y  must  be  negative  since  jixg  is  negative) 

/a/  =  1-32270. 

Thus  the  curve  starts  at  6-19118  lips,  or  the  one  medusa  with  seven  lips  is 
theoretically  excluded.  Here  I  have  worked  with  the  uncorrected  moments 
because  the  lips  are  discontinuous  variants.  Working  with  Sheppard's  corrective 
terms  the  limit  is  about  six  lips,  and  with  the  corrective  terms  suggested  in  my 
memoir  on  skew  variation  the  limit  is  7-65.    Further  we  have  : 

log     =  6-829,3633, 

distance  from  mean  to  mode  =  -30541, 

Sk.  =  -54941. 

The  mode  is  thus  at  5-17389,  in  good  agreement  with  observation. 
The  equation  to  the  curve  is,  taking  x  positive  from  6-19118  lips  towards  lesser 
values ; 

^'826  84^5 

log  y  —  6-829,3633  —  8-66184  log  x  —   7-^-. 
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This  curve  was  drawn  on  a  large  scale  and  its  areas  read  off  with  an  integrator. 
The  following  theoretical  frequencies  were  obtained  : 


No.  of  lips. 

Observation. 

Calculation. 

6  and  over 

50 

47 

5 

798 

762 

4 

123 

160-5 

3 

18 

20 

2 

5 

5 

1 

2 

1-5 

There  would  not  be  any  serious  divergence  here,  w^ere  it  not  for  the  group  with 
four  lips,  which  observation  shows  to  be  much  under-represented.  But  it  must  be 
remembered  that  we  have  only  seven  groups,  and  that  such  a  number  is  very 
insufficient  for  a  good  determination  of  the  moments  of  a  curve.  Further,  the 
variation  is  not  really  continuous,  as  indicated  by  the  curve,  but  discrete.  We  have 
at  present  no' clear  statement  as  to  how  the  moments  of  a  discrete  system  of  variation 
should  be  modified  or  corrected  so  as  to  give  the  best  results  for  the  moments  of  the 
continuous  curve  which  is  to  theoretically  represent  the  series.    I  am  doubtful 
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whether  Sheppard's  corrections — the  best  for  continuous  variation — are  equally 
appropriate  in  this  case.    Above  I  have  used  merely  the  rough  moments,  but  I 
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have  found  by  considerable  experience  that  in  the  case  of  discrete  variables,  to  treat 
the  system  as  a  polygon  and  correct,  as  in  my  memoir  on  Skew  Variation  ('  Phil. 
Trans.,'  A,  vol.  186,  p.  350),  appears  to  give  the  best  results  when  the  areas  are 
compared  with  the  discrete  groups.  The  point  wants  further  investigation  ;  when 
we  have  a  large  number  of  groups  it  is  of  little  importance,  but  it  makes  a  consider- 
able difference  in  these  excessively  skew  distributions  of  discrete  variables  when  the 
number  of  groups  are  small.* 

Above  all,  the  diagram  (fig.  ii.)  shows  how  all  important  it  is  to  compare  areas  and 
not  merely  the  ordinafes  of  the  frequency  curve  with  the  blocks  representing  the 
discrete  frequencies  in  such  a  case  as  this.  The  wide-spread  custom  among  foreign 
investigators  of  comparing  merely  the  ordinates  of  the  theoretical  frequency  curve 
with  the  observed  frequencies  leads  in  such  cases  to  most  fallacious  results. 

(8.)  (C.)  On  the  Distribution  of  Incide7ice  of  Scarlet  Fever  Cases  loith  Age. 

It  seems  desirable  to  give  an  illustration  of  the  method  of  dealing  with  a  distri- 
bution which  falls  under  the  class  dealt  with  in  Section  (5)  of  this  paper.  Dr. 
Macdonell,  in  dealing  with  the  intensity  of  incidence  of  different  diseases  at  various 
ages,  has  come  across  in  scarlet  fever  a  good  illustration  of  curves  of  the  types  now 
under  consideration.  The  whole  of  the  arithmetical  work  on  the  present  example  is 
due  to  him,  and  I  have  to  thank  him  very  heartily  for  allowing  me  to  use  it  here. 

The  statistics  are  taken  from  the  '  Report  of  the  Metropolitan  Asylums  Board ' 
(Statistical  Part,  1899).    They  involve  39,253  male  cases,  distributed  as  foUoAvs  : — 


Year  of  life. 

Frequency. 

Year  of  life. 

Frequency. 

Under  1 

443 

20-25 

926 

1-2 

1456 

25-30 

420 

2-3 

2631 

30-35 

215 

3-4 

3599 

35-40 

91 

4-5 

3862 

40-45 

45 

5-10 

15791 

45-50 

26 

10-15 

7359 

50-55 

17 

15-20 

2366 

55-60 

5 

60-65 

1 

The  data  being  grouped  partly  in  one  and  partly  in  five-year  periods  the  moments 
had  to  be  calculated  with  caution,  separating  the  material  into  two  pieces.  Taking 
five  years  as  the  unit,  Dr.  Macdonell  found  for  the  uncorrected  moments  : 


*  E.g.,  petals  of  buttercups,  teeth  on  the  carapace  of  prawns,  lips  of  medusse,  as  compared  with  veins  on 
chestnut  leaves,  florets  on  ox-eyed  daisy,  &c. 

VOL.  CXCVII.  A.  3  N 


458 


PROFESSOR  K.  PEARSON  ON  MATHEMATICAL 


Mean  age  of  incidence,  8 '60975  years. 
IX.  =  1-369,345 
=  3-233,194 
=  19-143,575. 

The  moments  were  not  modified  by  Shbppard's  corrections,  for  these  suppose  contact 
of  a  high  order  at  both  terminals  of  the  curve,  and  it  was  quite  apparent  that  the 
curve  must  rise  at  a  finite  angle  on  the  l^irth  side.  The  following  additional  constants 
were  then  determined  : — 

=  4-071,222.  /3.2  =  10-209,333, 

=  2-205,000.  K,=  2-813,783. 

Thus  /f.j  is  >  1  and  <oo  and  the  distribution  is  of  Type  VI.    Now  let  us  suppose  the 
incidence  of  scarlet  fever  to  start  with  birth,  although  there  might,  as  in  the  case  of 
enteric  fever,  be  really  some  antenatal  cases.* 
Turning  to  Section  (5)  we  have  : 

c  =  distance  from  birth  to  mean  =  8*60975  years  =  1*72195  units. 

Hence  we  deduce 

yo  =  -461,819,  73  =  -685,596. 

And  so  from  (xxix.) 

q^-.q^=  —  10-532,485,  q^-\-q^  =  15*417,281  ; 

or,  ^1  =  12-974,883,  go  =  2-442,398. 

Then  from 

c  =  ^/  -  a  =  a  ((?2+  l)/(gi  -  ?2  -  2) 

we  find 

a  =  4-268,104, 

and,  finally,  after  determining     from  (xxi.), 

log^o  =  13*652,5078. 
Thus  the  values  of  the  frequency  are  given  by 

log^=  13-652,5078  +  2-442,398  log  (£c  — 4-268,104)  -  12*974,883  logo;. 
The  origin  of  the  curve  is  thus  4-268,104  before  birth.    The  mode  is  given  by 

a^mo  =  coqAqi-qd  =  5-257,842. 
Thus  :  x^^  —  a  - -  '989,738  =  4-94869  yrs. 

*  See  '  Phil.  Trans.,'  A,  vol.  186,  p.  390.  The  remarkably  sharp  rise  of  the  scarlet-fever  distribution 
as  compared  with  the  enteric  is,  however,  much  against  this. 


CONTRIBUTIONS  TO  THE  THEORY  OF  EVOLUTION. 


459 


This  gives  for  y.noae  the  value  3892. 

Distance  between  mode  and  mean  =  3*66106  yrs. 

Whence  we  find  for  skewness  the  value 

Sk.  =  -5347. 

The  diagram  (fig.  iii.)  shows  that  the  fit  may  be  considered  a  good  one. 
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(9.)  The  conclusions  of  this  paper  are,  I  think,  of  some  interest  from  the  general 
standpoint  of  scientific  investigation.  A  certain  number  of  frequency  distributions 
had  been  found,  not  only  by  my  co-workers  and  myself  here,  but  by  biologists  in 
America,  not  to  fit  into  the  general  system  of  skew  distributions  dealt  with  by  me  in 
my  original  memoir.  The  first  conclusion  was  that  however  wide-reaching  that 
system  appeared  to  be,  it  was  a  failure  for  a  few  remarkably  skew  distributions.  But 
on  more  careful  investigation  of  the  differential  equation  it  appeared  that  two  types 
of  solution  had  been  left  out  of  consideration,  and  that  these  were  precisely  those 
needed  in  the  recorded  cases  of  failure. 

I  owe  some  apology  to  authors  like  Professor  Davenport  and  Dr.  Duncker,  who 
have  recently  issued  text-books  on  the  application  of  statistical  methods  to  biological 
variation,  because  although  we  have  known  and  used  these  curves  for  some  years  past, 
no  account  has  hitherto  been  published  of  them,  and,  consequently,  biological 
investigators*  using  their  resumes  of  my  methods  have  been,  and  I  fear  still  may  be, 
occasionally  puzzled. 

*  E.g.,  Mr.  A.  G.  Mayer  in  the  paper  on  Medusse  referred  to  above. 
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(1.)  Introductory.    On  the  Influence  of  Selection  upon  Correlatioi. 

At  an  earlier  stage  in  the  develoj^ment  of  the  statistical  theory  of  evolution  it 
was  suggested  that  the  coefficient  of  correlation  (Galton's  function)  might  be  found 
constant  for  all  races  of  the  same  species— in  fact,  it  was  considered  possible  that  this 
coefficient  might  be  the  long-sought-for  criterion  of  identity  in  species.  Professor 
Weldon,  following  up  this  suggestion  of  Mr.  Galton's,  then  made  the  elaborate 
series  of  measurements  on  crabs  with  which  his  name  will  always  l)e  closely  associated. 
To  a  first  approximation  these  researches  seemed  to  confirm  the  possibility  of 
Galton's  function  being  a  true   criterion   of  species.    When,  however,  a  finer 
mathematical  test  was  applied  to  Professor  Weldon's  observations  as  well  as  to 
other  statistical  series  for  organs  in  man,^'  it  became  clear  that  tlie  coefficient  of 
correlation  varied  from  local  race  to  local  race,  and  could  not  be  used  as  a  criterion 
of  species.    A  slight  investigation  undertaken  in  the  summer  of  1896  convinced  me 
that  the  coefficient  of  correlation  between  any  two  organs,  is  just  as  much  peculiar  and 
characteristic  of  a  local  race  as  the  means  and  variations  of  those  organs.    In  fact, 
if  local  races  be  the  outcome  of  natural  selection,  then  their  coefficients  of  correlation 
must  in  general  differ.    The  object  of  the  present  paper  is  to  show,  not  only  that 
natural  selection  must  determine  the  amount  of  correlation,  but  that  it  is  probably 
the  chief  factor  in  the  production  of  correlation.    If  selection,  natural  or  artificial, 
be  capable  of  producing  correlation,  then  it  seems  impossible  to  regard  all  correlation 
as  evidence  of  a  causal  nexus,  t  although  the  converse  proposition  that  all  causal 
nexus  denotes  correlation,  is  undoubtedly  the  most  philosojDhical  method  of  regarding 
causality. 

In  dealing  with  the  influence  of  selection  on  correlation,  I  shall  suppose  the  distri- 
bution of  complex  groups  of  organs  to  follow  the  normal  correlation  surface — the 
generahsed  Gaussian  law  of  frequency.  I  shall  further  assume  the  selection  surfaces 
to  be  iiormal  in  character.  Neither  of  these  assumptions  is  absolutely  true,  but  the 
Gaussian  law  in  a  good  many  cases  describes  the  frequency  sufficiently  closely  to 
enable  us  to  obtain  fair  numerical  results  by  its  application.  Probably  in  all  cases  it 
will  enable  us  to  reach  qualitative  if  not  accurate  quantitative  theoretical  deductions. 
I  have  the  less  hesitation  in  asserting  this,  as  Mr,  G.  U.  Yule  has  recently  succeeded 
in  deducing  the  chief  formulse  for  correlation  and  regression  as  given  by  the 
Gaussian  law  from  general  principles,  which  make  no  appeal  to  a  sjjecial  law  of 
frequency.^ 

*  "  Mathematicul  Contributions  to  the  Theory  of  Evolution.— IIL,"  'Phil.  Trans.,'  A,  vol.  189,  pp.  266 
and  280.  See  also  E.  Warren,  "Variation  in  Poriwnus  depurator"  'Roy.  Soc.  Proc.,'  vol.  60, 
pp.  233-4. 

T  See  a  series  of  letters  in  '  Nature,' vol.  54,  1896,  arising  from  a  discussion  upon  a  paper  by  A.  R. 
Wallace. 

X  'Roy.  Soc.  Proc.,'  vol.  60,  p.  477. 
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In  particular,  if  a  selected  group  be  not  given  with  very  great  accuracy  by  a 
normal  frequency  surface,  still  we  may,  I  think,  consider  ourselves  justified  in 
supposing  that  the  effects  of  the  actual  selection,  and  those  of  a  normal  selection 
Math  the  same  meajis,  the  same  amounts  of  variation,  and  with  correlations  of  the 
same  intensity  will  be  at  least  qualitatively  alike  in  character,  if  they  be  not  indeed 
exactly  the  same  quantitatively  within  the  limits  fixed  by  the  probable  errors  of  the 
constants. 

My  plan  in  this  memoir  will  be  as  follows  : — I  shall  first  state  the  fundamental 
theorem  in  multiple  correlation  with  a  new  proof,  so  that  the  formulae  required  may 
be  once  for  all  collected  for  reference.*  I  shall  then  give  the  algebraic  investigation 
of  the  new  formulse  for  selection.  I  shall  afterwards  consider  sjaecial  simple  cases, 
and  illustrate  them  by  examples.  Finally  I  shall  draw  attention  to  the  nature  of 
the  selective  death-rate  as  indicated  in  cases  of  this  kind,  and  consider  at  length  its 
algebraic  theory.  Throughout  I  shall  endeavour  to  illustrate  the  somewhat  complex 
algebra  by  arithmetical  examples. 


(2.)  On  the  Fundamental  Theorem  in  Multiple  Correlation. 

I  have  shown  in  my  memoir  on  "Regression,  Heredity,  and  Panmixia"  ('Phil. 
Trans.,'  A,  vol.  187,  p.  261)  that  if  the  n  variables  of  a  complex  be  functions  of  m 
(m  >  n)  independent  variables  with  frequency  distributions  following  the  normal  law, 
and  such  that  the  principle  of  superposition  holds  for  the  deviations  from  the  means 
supposed  small  ;  then  the  frequency  of  the  complex  with  deviations  from  the 
means  of  the  n  variables  lying  between  and  Xy  +  Sx^,  Xc^  and  Xc^ -\- hxa  .  .  .  x, 
and  x„  4-  hx,^  will  be  z      hxc^  .  .  .  dx,,  where  z  : 

z  =  Zq  expt.  -  1  (Sj  {c^^pXj,/)  +  2So  {cpjXpX;)  )  (i.). 

Here  Zq,  Cp^,,  c^^  .  .  .  are  constants,  and      denotes  a  summation  for  every  value  of 
and  So  for  every  pair  of  values  of  p  and  q  in  the  series  from  1  to  n. 

In  the  same  memoir  (p.  302)  I  have  determined  the  values  of  z^,  c,,^,  Cp,  in  terms  of 
the  correlations  rp,^  and  the  standard  deviations  a-p  a] id  cr^  of  the  n  variables.  This 
point  had  already  been  considered  by  Professor  Edgeworth  ('  Phil.  Mag.,'  vol.  34, 
p.  201,  1892),  and  some  further  results  by  Mr.  A.  Black,  reached  before  his  death  in 
1893,  were  published  in  the  '  Camb.  Phil.  Trans.'  (vol.  16,  p.  219,  1897).  The 
present  investigation  is,  I  think,  novel,  and  adds  to  results  already  reached  others 
required  in  the  present  memoir,  so  tha.t  it  thus  places  together  with  a  fairly  simple 
proof  all  the  fundamental  results  to  which  I  shall  have  occasion  to  appeal  later. 

*  We  have  used  these  formulae  for  several  years,  but  they  do  not  appear  to  have  been  hitherto  published 
in  a  collected  form. 
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Let  us  consider  the  quadric  of  the  w*''  order 

Q  =  Si  (%..v)  +  2S2  {cj,^Xj,x,')  =  constant, 

and  fix  our  attention  on  two  of  the  variables,  say  the  first  two,  and  x^.  If  these 
be  considered  constants,  the  quadric  of  the  remaining  n  —  2  variables  will  not  now 
be  referred  to  its  "  centre."  But  its  centre,  .^3,  x^  .  .  .  x,„  wnll  be  given  by  the 
equations  : 

Cj3.ri  +  Co^x^  +  C;-iX^  +  C43X4  +    .    .    .     +  c„3.-r«  =  0. 

C14.X1  -j-  C2iX-z  "1"  C-nX-i  -\-  -|-      .      .      .      -|-  C„iX,^  =  0. 


The  following  expressions  will  not  be  zero,  but  will  be  written  a  and  y8 
^11^1  ~l~  CoiX-,  -J-  C31X3  -f"  C41X4  C,aX„  —  CL. 

Cjoti'i  ~r  Ci'ipCi  "i"  C32.T3  ■\-  C42.T4  -f-    •    ,    .    -j-  Cii'>Xii  =  /3. 
Now,  if  A  be  the  discriminant, 


(iii.). 


^13} 

C22i 

Cos,       .  • 

^211 

C33; 

C«2) 

C/,3, 

and  C^^j  the  minor  corresponding  to  c^fj,  we  have  by  solving  the  above  n  linear 
equations, 


A 


whence  : 

Generally  also 


a  = 


x„ 


^Ci,  +  /3C2, 


/3  = 


CnC'22  ~  ^13" 


(v.). 


  (^12C'22  —  C'3}C]2)"^'l       (^11^22  —  ^1(Z^12)"*^2 

P    P  P  2 

=  -Di^.Ti  +  D2jaj.,,    say,  (vi.). 
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This  determines  the  central  co-ordinate  for  any  variable  x,j  for  a  given  value  of 
and  x,^. 

Now  let  us  transfer  the  quadric  to  x^,  x^  .  .  .  x„  as  origin.    It  may  be  written 


+ 


+  ^2 


+  C,,jX,) 

-p  CjiqXif  ^ 
— j-  CjiqX  jj 

~\"   C/ii-^'  a, 


-\-  C112X.H  ^ 

-\-  C112X  „ 


+  Cs-2X3-\- 

Making  use  of  the  n  linear  equations  (ii.)  and  (iii.)  : 

-f-  ajj^  (a  -f"         3  4"  ^4^1^  4  "h 

~1~  '^"i  ^32"^  3  "i"  ^43'^  4  "I" 

+  ctx^  +  /3a-o  (vii.)- 

For  arranging  vertical  columns  in  rows,  the  remaining  terms  are 


-f-  C;,2"^  „) 
-j-  CiiijX 


each  line  of  which  vanishes  by  the  equations  (ii.)  for  the  centre. 
Accordingly  : 

Q  =  Q'  +  aa-^  +  lix.^, 

M^rere  Q'  is  a  quadratic  function  of  x\,  x\,  .  .  .  x\„  not  involving  cc,  and  %  at  all. 
Hence:  2  =  2^ e" +  + 

Now  integrate  z  with  respect  to  all  the  variables  cc'g,  x\,  .  .  .  x' ^  from  —  00  to  +  00  , 
keeping  x^  and  %  constant. 

Then,  although  the  origin  is  a  function  of  x^  and  Xg, 


...      z^e'^'^'  dx'.Ax\  . . .  dx,i 

J  — on    J  — Xi      J  —  Cfj 


cannot  involve  x^  and  x^  but  only  c^q,  c^q . .  .  Cnq,  &c.  ;  let  the  result  be  ^q.    Then  : 
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must  be  the  correlation  surface  for  and  a?2>  foi"  values  of  x^,  x^,  . . .  x„.  We  may 
write  it  in  the  form 

z  =  i^  exjDt.  —  f  A  p  p   _n  2   (vni.). 

11    22  13 

Comparing  this  with  the  known  form  of  the  correlation  surface  for  two  variables,* 

27ro-iO-o\/l  —  ^  ^  1  —        \cri~         0-10-3         o-.-/  ^  ^ 

we  have  at  once 

err  (1  -  n/) = ^         ^2  (1  -  ^  ic )  - — M^r~' 

Ty^cr^a-c,  =  tJ^,/^   (x.). 

Whence                                 ■''12  =  CJjo/v^Ci^Cgg   (xi-)) 

cri2  =  Cn/A,        o-,^  =  ao/A   (xii.). 

Or,  generally  : 

t'pq  =  GjxjV^ppGqq   (xiii.), 

a  J,  —  a/C^^/A  (xiv.). 

Thus  correlations  and  variations  are  fully  determined  in  terms  of  the  discriminant 
and  its  minor  for  the  constants  c^-^,  Cog,  c^o .  . .  Cpp,  Cjrp  Cj,rp  •  •  •  C;„,. 

We  have  next  the  inverse  proposition  to  find  the  r's  in  terms  of  the  r's  and  cr's. 
We  have,  by  well-known  propositions  in  the  theory  of  determinants  : 

fiiCii  +  CioCio  +  C13C13  +    .    .    .    +  Ci,,C„  =  A. 

'^n^n  +  e^isCo..  +  C13C23  +  .  .  .  +  Cy,Co„,  =  0. 
CnCsi  +  C12C32  +  C13C33  +     .     .     .      +  Ci,C.,,,  =  0. 


CiiC;a  -4-  Ci2C,,2  +  C13C,,.,  +     .     .  .  +  Ci„C,,„  =  0. 

CllO"l'"  +  Ci2?-i2(7-iO-2  +  fi3^*13CriO-3  +  .  .  .    +  Ci,,ri,,(TyO;,  =  1, 

Cn''«iO'iO-2  +  Clocr.-  +  CigrosCTscra  +  .  .  .  +  Ci,,r2>!0--.a;,  =  0, 

Ci  1^310-10-3  +  CioVsoO-.O-s  -f  C13O-3"  +  .  .  .  +  c,„7-3„o-30-,,  =  0. 


Ciir,„o-iO-;,  -f  Ci2r;,20-20-„  +  Cisr^go-aO-;,  +    .    .    +  Ci«o-„2  =  0. 

Hence  solving 

Cu  =  Sii/S,       Ciy  =  Si^/S  (xv.), 

*  Pearson,  'Phil.  Trans.,'  A,  vol.  187,  p.  264. 
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where  S  = 


But 


where  R  is  the  determinant, 


or, 

(Ts,       •     •     •  ^'suO-'.^a-,, 

corresponding  to  the  constituent  rj„j(rj,a^ . 

S 

o      '1  9 

.  .  .  (r;,^R, 

0       il  n 

.  .  .  cr^,cr,^  .  .  .  (r,t~'Rj,q, 

1, 

• 

1, 

5 '23,  • 

5  31, 

1,  . 

.  1 

(xvi.), 


and  R^„j  is  the  minor  corresponding  to  the  constituent  r^,,^.    Thus  we  have 


^11  =  ~o  Rii/^^,        <^vi  =  ^  Rjr/./R 


(xvii.), 


or,  generally, 

Thus  z  may  be  written'"' 


^Vy^  =      if'  =  ^~  '^pJ^  (xviii.). 


a-  ./  E 


.  =  .„expt.-i{s.(!??^)  +  2S,('|^^)}     .    .    .  (xix.). 
It  remains  to  determine  Zq  from  the  fact  that  the  volume  of  the  surface  =  N. 

r+T.     r+x  r  +  x 

Or,  N  =     I  •  ■  •       6xpt.  (—  ^Q)  dx^  dxo .  .  .  dx,, 

=  z,o;a,  .  .  .  cr.  r  [\  .  .        expt.  -  1  {s,  (^..;^)  +  2S,  o^^j} 

r/x\  dx\  .  .  .  dx'„   (xx.), 

by  writing  Xplcr^,  =  x'j„  &c. 

Now,  integrate  first  with  regard  to  x\  writing  first 


*  '  Phil.  Trans.,'  A,  vol  187,  p.  302. 
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where  is  the  sum  for  all  values  of and  So  for  all  pairs  of  values  of  j9  aud  q,  from 
2  to  «  inclusive, 

=  1^  {x\  +  H',)^  +  +  2^2  (if  where  H\  =     (|-  ^'.j  . 

For  if  =  Rj],  or  the  determinant  of  the  correlation  coefficients,  omitting  all 
involving  the  first  variable,  i.e.,  the  first  row  and  column  of  R,  the  determinant  li'^^ 
corresponding  to  the  minor  of  the  constituent  Vp^  in  R'  or  to  the  second  minor  (.  Rn)^,^ 
of  R  is  given  by 

R;,  =  (R^,Rji  -  Ri^Rj,)/R  (xxi.), 

and  for  j)  ■= 

R;,.  =  (R,,,Rn  -  Ri/)/R  (xxii.). 

Hence,  integrating  x\  between  the  limits  +  oo  and  —  oo,  we  have 

N  =  2oO-iO"e .  .  .  o-„ .  v/27r  ^J ^/ 

X       f '^  .  .  f expt.  -  1       (^;^  a;^)  +  r«>',)|  dx' ,  dx', .  .  .  dx',,. 

This  is  of  precisely  the  same  form  as  before,  except  that  we  have  the  factor 
\/ 2-77  \/R/R',  and  the  multiple  integral  is  reduced  by  one  integration  and  by  the 
disappearance  of  all  correlations  involving  the  first  variable.  Now,  integrate  with 
regard  to  x\y  The  sole  effect  will  be  to  multiply  by  a  factor  \/ 2^  \/Ji'/R",  where 
R"  is  the  minor  of  R'  not  involving  correlations  of  the  second  variable.  Thus,  by 
repeating  the  process,  we  have  ultimately 

N  =  z,o;a-, .  .  .  (7„  {^2n)"  ^/n/W  ^/W|W  yRTR^  •  •  •  \/ ^  ' 

 _N  ,  .... 

or,  Zr.  =  -—-  7—  (xxni.  I, 

which  gives  the  constant  of  the  surface. 

The  preceding  investigation  enables  us  also  to  deal  with  two  further  points. 

(a)  Given  n  variables,  what  is  the  mean  value  ni^  +  x-^  of  the  first  variable  and 

its  variability  cr\  for  definite  values  m.,  +  h..,  +  A.  .  .  .  +  h,,  of  the  other 
{n  —  1)  variables  ? 
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Clearly,  when  x'.;pc'.^  .  .  .  x'„  are  constants,  the  distribution  of  a/^  is  of  the  form 

constant  X  expt.  —  |-  "^^.^        +     (^'' ■ 
Or,  re-introducing  the  cr^,  a.,  .  .  .  cr,„  we  have  a  distribution  about  the  point  given  by 


X,  =  —  O^i 


-^12  '''3  I  5]?    _i_     _i_  '^'A 


>  .    .    .    .  (xxiv.). 


with  standard  deviation  cr\  =  cr^  \ 

(b)  Given  n  variables,  what  are  the  mean  values  +  x^,  +  Xo,  the  standard 
deviations  a-'\,  a-" a,  and  the  correlation  r^..  of  two  of  them,  when  we  give  definite 
values  mg  +  /ig,  to,.  +      .  .  .  .       +  A„,  to  the  remaining  {n  —  2)  variables  ? 

In  this  case  we  have  from  (i.) 

z  =  expt.  —  |-  {cn^i~  +  2<?io.'ri»-.,  +  CooO;/ 

+  2  (c'lg/ig  +  Cj^?,^  -h  .  .  .  -I-  cji„)xi 

+  2  (CogAg  +  G-lA      +   .   •   •    +  C.ah.,)  X, 

+  terms  not  involving  x^  and  X2}  .    .    .    .  (xxv.). 
Writing      for  the  coefficient  of  Xj,  and  Ko  for  that  of  cc.,  we  have  for  the  centre 

-  _      (K^Cog  —  KpCjg)  -  _  _  (  —  K^c^3  + 

•^1  —  ,  ,2'  "'-'S  —  ^„  ^2' 

^11^' '>2         12  11  22  12 

  y    /^]p%3        '^2i'^13  I    \    / ^2P^1  ~  ^IP^\2  ]  \ 

X^    -^l  I  2  "/>  )  5  "^2  —         •^l  1  3         /  ' 

\  ^11^22        ^12"        /  \  '^ll''22        ^12"  / 

-    ^  ^  /E;^R33  —  RgpRja  /lA         -     ^  ^  /RopE^  —  Rij)Ri2  ^\ 

^  ^        \.   ■K'11-^22  ~  -'^IS^  ^        V   -^"^ll-f^22        -^12"      ^pJ  ' 

by  transferring  to  the  minors  of  R  and  the  cr's.  Or, 

Here  R"  is  the  determinant  formed  by  striking  out  the  first  two  rows  and 
columns  of  E.  ;  p'\p  is  the  minor  obtained  by  striking  out  the  second  row  and  column 
from  R,  and  then  the  first  row  and  p*''  column ;  p'.2p  the  minor  obtained  by  striking 
out  the  first  row  and  column,  and  then  the  second  row  and  p*''  column.    But  a 

comparison  with  (xxiv.)  shows  us  that  these  values  for  a-^  and  x^  are  precisely  what 
we  should  have  obtained  for  the  regression  equations  of  the  1st  and  2nd  variables 
respectively  a/one  on  the  other  u  —  2  variables.     Thus  the  existence  and  the 

correlations  of  Xc,  have  no  effect  on  -the  value  of  x^,  nor  those  of     on  the  value  of  x^- 
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lletiuniiig  to  (xxv.),  we  remark  that  the  terms  of  the  seooiul  order  in  ,t,  and  .v.,  on 
which  the  correlation  and  variations  dejjend,  are  not  altered  by  a  transfer  to  the 

centre      and  x.,  of  the  array. 

Hence  by  (ix.)  and  (xvii.)  we  have 


—  =  ~  K,o/  V' RjiR'e  (xxvil,). 

This  is  the  partial  correlation  of  the  Lst  and  2nd  organs  for  the  remaining  rt  —  2 
organs  with  constant  values. 
Again, 


'      ^22  —  -//  3 


— "  3/1  =;  \ 


Whence  we  easily  find  from  (xviii.)  and  (xxvii.) 


//  3          „  -2 


RE, 


1  ~    1  R  R    —  R  3 

n    23  12 


>     cr  0  =  cr. 


R  R, 


or, 


o-"i  =  o-j  v/R^R"'     ^"-i  =      v/lii,/R"    ....  (xxviii.), 

where  Ji"  is,  as  before,  the  determinant  K  without  its  first  two  rows  and  columns. 
These  by  (xxiv,),  are  what  we  should  have  reached  by  ignoring  x^,  in  finding  (t\,  and 

ajj  in  finding  o-'^. 

(3.)  (jreueral  Theorem  in  Selection. 

To  find  the  selected  means,  the  selected  oari  itions  and  selected  correlations,  ivhen 
q  organs  are  selected,  natui'olly  or  artificially,  out  of  a  complex  of  n  organs. 

Let  the  selected  group  of  q  organs  have  their  means  raised  h^,  h.,,  h.^,  .  .  .  h,^  (some 
of  these  quantities  may  be  negative)  ;  their  standard  deviations  changed  from 
a-],  0-3,  .  .  .  cr^  to  s^,  S.2,  s.^,  .  .  .  s,j,  and  their  mutual  correlations  from  rjo,  Ty^,  .  .  .  r^^, 

^'33'  '''s-iJ  •  •  •  '''2?'  •  •  •  ''V-l//        PlZ'  Pl3'  •  •  •  Pi'/'  Pi'.i'  P-2l>  •  •  •  P-2'n  ■  •  •  Pi-l.r 

The  whole  system  of  n  organs  before  selection  will  be  defined  by  the  means  as 
origin  of  measurement  for  each  organ,  by  the  standard  deviations  o-j,  cr.,,  0-3,  ..  .  cr„, 
and  by  the  coefficients  of  correlation  r^,,  ri..,,  .  .  .  ri„,  r...,,,  r.,^,  .  .  .  r.,,„  .  .  .  Let 
R.  be  the  determinant 


v,.„ 


«3> 


and  R„„  the  minor  corresponding  to  the  constituent  r,,,.  Then  the  unselected 
population  is  given  by  the  frequency  surface  of  equation  (xix.), 
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.  =  constant  X  expt.  -  i  {s.        Q  +  2S',  (?f  |^)}  .    .  (xxix.). 

For  brevity,  we  can  also  write  this  in  the  form 

z  =  constant  X  expt.  -  ^  (S^  {Cp,Xp^)  +  28,  (c^^^XpX,)}    .    .  {xx'ix.fK 

Now  consider  for  the  time  only  q  +  I  organs— namely,  the  first  q  organs  and  the 
in}^  organ  (m  >  q),  and  let  us  write  R  (w),  if  u  be  >  q,  for  the  determinant  : 


R  (u)  = 


1, 

'■l3,  • 

1, 

1,  • 

•  1, 

T 

r.,„ 


(xxx.). 


Then  if  R  {u)p,p„  be  the  minor  corresponding  to  the  constituent  r^.p,,,  and  if 


frequency  distribution 


the  distribution  of  the  g  +  1  organs  will  be  given  by  the 
constant  X  expt.  —  |-  {8>^{hp,^,,x/)  +  '28.2{bp,p„Xp,x^y)\  .    .  (xxxi.). 


being  a  sum  for  every  value  of  p'  throughout  the  2  +  1  organs,  and  for  every 
pair  of  values. 

Now  let  the  first  q  organs  be  given  values  h^,  h.,,  .  .  .  //^,  then  the  mean  value  of  x„ 
will  be  given  by 


^  K + ^'  A, + . . . + ^  k 


1  /  ' 


(xxxii.). 


Now  these  coefficients  ca,n  be  found  at  once  if  q  be  known. 
For  example  : 


E(«)»„        1  -  r,,'  '  R(?0™  "    1  -  '•rr 


=  3, 


and 


R  (?0"«  1  —  ^'n  —  ■?*23'  —  "'\s'  +  2ri,?%3ri.3 

R  ,       R  (m),5„  1-11 

—  w^rr~  and  —  ^ry^,'  can  be  written  down  by  symmetry 

c  2 


(xxxiii.). 
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Now  suppose  in  the  expression  (xxix.)''"  we  were  to  put  x^,  .Xo,  .  .  .  x,j  equal  to 
h-^,  7^2,  .  ,  .       respectively,  then  the  system  of  equations  to  find  tlie  means  of 
^g+z^  •  •  •  ^i+v-',  •  .  .  a-,,  for  this  array  M'ould  be  the     —  q  equations 


Cv,  Ih  +  Cou  /<2  +  •  •  •  +  Cg.x  ^hj  +      +  1+  C.y  +  2,«  •'<"'?+2+  •  •  •  +  ^/o,  ^'t'',/  =  0  .  (xxxlv.), 

where  aj'^+j,  x'^+o,  .  .  .  x'-,^  are  the  "co-ordinates  of  the  centre"  of  the  array  of  the 
71  —-  q  organs. 

If  we  were  to  solve  these  equations  we  ought  to  get  precisely  the  solution  for 
x'„  {u  >  q  Kn  -\-  1)  that  we  have  found  for  x',,  in  (xxxii.)  above,  where  none  of  the 
coefficients  involve  correlation-coefficients  other  than  those  of  tlie  first  q  organs 
among  themselves  and  with  the  w***  organ.  This  result  flows  from  the  pretty  obvious 
law  that  the  mean  of  the  organ  for  an  array  determined  by  values  of  the 
first  q  organs  cannot  be  in  any  way  dependent  on  our  considering  the  relation  of  this 
selection  of  q  organs  to  any  additional  organs  beside  the      :  see  p.  9. 

Thus  the  solution  of  (xxxiv.)  is  simply  obtained  by  putting  ii  =  q  -\-  1,  q  -\-  2, 
q  -\-  S,  .  .  .  n  successively  in  (xxxiii.). 

Let  us  now  select  the  first  q  organs  not  with  absolute  values,  but  varying  about 
means  h^,  h.^,  .  .  .  hj,  with  standard  deviations  s^,  -So  .  .  .  s^,  and  with  mutual  corre- 
lations /)jg,  .  .  .  Ply,  /3o3,  .  .  .  p.yp  .  .  .  pq-\q-  We  have  then  to  multiply  {  in  (xxix.)'"'* 
by  an  exponential  quadratic  function  of  the  x^-\-k^,  ^'■z'^^'i^  •  ■  •  '>'q-\-^hj,  ^-c,  the  selective 
correlation  surface,  and  divide  it  by  another  exponential  quadratic  surface,  i.e.,  the 
primary  coiTelation  surface  of  the  q  organs  x^,  x^  .  .  .  x,^.  This  follows,  since  the 
frequency  of  each  complex  of  n  organs  must  be  reduced  in  the  ratio  of  the  selected 
to  the  primary  frequency  of  the  complex  of  q  selected  organs.  But  it  will  be  clear 
that  such  a  reduction  must  give  us  a  result  of  the  following  form  for  the  final 
frequency  surface  of  the  n  organs  : 

Z  =  constant  X  expt.  —  \  jc^pXi"  -f  ^ccg^  +  .  .  .  +  c,^,^x/ 

+  2c,,,„x,,x,„  -\-  .  .  .  {y'  and  v"  <C  q  I) 
+  2c,„v,x„  -\-  .  .  .  {o  <  q -\-  I  and  ^^  >  q) 
-4-  '2Ci,.„.x-.,x,r'  +  .  .  .  [vf  and  u"  >  q) 

-f-  linear  terms  in  av,  ^i"  (v'  and  v"  <  7  +  0    •  (xxxv.), 
where  the  c's  denote  the  changed  c's. 
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Now  if  we  differentiate  the  quadric  to  find  its  "  centre  "  we  have  n  equations  in 
.  .  .  Xj  .  .  .  x,„  but  the  solutions  of  these,  if  Xj+^  .  .  .  x„  were  eliminated,  are  known 
to  be  the  "  centre  "  h^,  h.  .  .  .  hq.    Hence  we  require  only  n  —  q  equations  involving 


x,,  and  we  can  put  the  h's  for  the  remaining  values  x^  .  .  .  Xg.    Let  us  take 
the  differentials  of  the    quadric  with  regard  to  x,^+^  .  .  .  x,„  then  the  resulting 


'•7+1 


equations  involve  none  of  the  ?s,  but  only  the  c's.  They  reproduce  in  fact  (xxxiv.). 
But  the  values  of  x\^+^  .  .  .  x'„  found  from  (xxxiv.)  are,  we  have  seen,  identical 
with  the  values  of  (xxxii.).    Thus  we  have 


E(?+l) 


+ 


R(?+i)?+i,«+i 


R(^  +  2)i 

g+2     °"g+2  J-^ 

R(?  +  2)r/+2.2+l  O-l 


1^(2  +  2)g+2,^+2  ^J' 


E(«)„„  o-,/  ^ 


(xxxvi, ). 


These  give  the  most  general  form  of  a  theorem  proved  for  a  particular  case  in 
'  Phil.  Trans.,'  A,  vol.  187,  p.  300,  c  (ii.).  If  systems  of  q  organs  he  selected  ivith  any 
arbitrary  variations  and  correlations  out  of  complexes  of  n  organs,  then  the  mean  sizes 
of  the  remaining  n  —  q  organs  have  precisely  the  same  values  as  if  the  selection  of  all 
the  systems  of  q  organs  liad  been  to  one  size  and  not  varied  about  mean  values.  The 
arbitrary  variatio7is  of  the  selected  systems  about  these  mean  values,  as  ivell  as  the 
arbitrary  correlations,  have  "no  influence  on  the  mean  changes  of  the  n  —  q  organs. 
Returning  to  equation  (xxxv.)  we  know  that  if  the  determinant 


A  = 


^'],2' 

^1,3 

^1,-7' 

^1,!?+1 

^2, 1' 

^'2, 2' 

^2,3 

^'2,(7+1 

Cq,  3 

Oqq, 

Gqq+i 

'       Cq,  n 

<^!?  +  ],l' 

^■7  +  1,2' 

^y+l.B 

^?  +  ],?' 

^1  +  1,9+1 

•     ^q  +  \,n 

-'11,1^ 


■'",1+1 


(xxxvii  ) 


be  formed,  its  constituents  and  not  the  hnear  terms  in  the  exponential  of  (xxxv.) 
determine  all  the  standard  deviations  and  correlations.  Let  2„  be  the  variation 
after  selection  of  the  w'^  organ  ;  then  if  u  be  one  of  the  selected  organs  %,i  =  5,,,  if  u 
be  for  one  of  the  unselected  organs  %„  has  still  to  be  found.    Let  x„v  be  the  correlation 
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coefficient  of  the  and  v'^  organs  after  selection,  then  if  u  and  v  be  selected  organs, 
Vu.  =  p>,r ;  if  they  be  not  both  selected  organs,  then  it  has  to  be  found.  Let  D„„  be 
the  minor  corresponding  to  the  constituent  c„„  in  the  above  determinant,  then  by 
(x  )  and  (xiv.) 

%r  =  ^  (xxxvni.), 

and 

=  ^  (xxxix.). 

Now  let  us  write  for  brevity  a,,,,  =  I)„„/A  and  a,„,  =  D„„/A.     Clearly  as  long  as  u  and  v 
are  less  than     +  1,       and  x„,  will  both  be  known,  i.e.,  are  equal  to  s,r  and  .s„s,.p„,. 
Now  by  a  well-known  property  of  determinants 

^Ly+l**!!  H"  '^'■Z,i+\^V2  "1~  ^Xt  +  l'^lS  +•••"{"  ^q,q+\^l'J  ~\~  ^'/+ 1,9  +  1^1,'/  + 1.  ~l~  ■  •  •   ~f~  <•'//,  +  =  0 


Ci,.,a„  +  C.^n^i-i  +  <-'.v/'5'i;i  +  .  .  .  +  Cy;,a,,/  +  c'y+j,„ai^,^+3  +  .  .  .  +  c^a^,  =  0    .     .  (xl.). 

Comparing  these  equations  for  the  n  —  q  unknowns  o^^,^_^_^,  <''i'7+3  •  •  •  with  (xxxiv.) 
for  finding  x'^^^,  ^^'f/+-2  •  •  •  ^'p,  we  see  that  they  are  absolutely  identical  if  we  change 
A],  h.^  .  .  .  Iiq  in  the  latter  into  a^^,  ajo  .  .  .  a^,^.  Accordingly  the  solution  is  given  by 
(xxxvi.),  or  we  have 


R(5  +  lVi,,+i  R(^  +  l) 


_  _  j   I^(g  +  ^)l.?+2  q"y+3  I       R(^+2)2  <7y+2  _|_  1     J^_(£+^)m±3_  ^+2 


_     r  R(7?.)i.„  o-„        R0i)2.«         1        I  o-''  1 

Provided  r  be  <  g  +  1  we  might  equally  well  have  used  a,i,  a,.,,  .  .  .  a.,^.  Hence 
we  conclude  that  if  u  be  <  ^  +  1  and  u  be  >  q, 

Or,  substituting  the  known  values  of  a„■^,  a.,^,  .  .  .  a,.^,  we  have  for  v  <  +  1  and 
u>  q 
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The  next  stage  in  our  work  is  to  find  a,„  when  both  y  and  u  are  >  r/,  and  also  a„„ 
when  y  =  and  both  are  >  q.  This  is  done  at  once  by  substituting  the  minors  a„■^, 
a,,.,,  .  .  .  a,,,/  .  .  .  in  the  equations  formed  from  the  last  2> — <I  lines  of  the  determinant 
(xxxvii.). 

We  obtain  the  following  system  : 


These  are  identical  with  equations  (xl.),  except  that  the  equation  with  the 
coefficients  Cy,,,  c.-,_,„  .  .  .  c„„  has  unity  instead  of  zero  on  the  right-hand  side.  Hence 
we  see  that  a,,,,  (i?  >  q)  will  be  the  same  function  of  a„i,  a„2,  •  •  •  that  x'„  is  of  li^, 
Jh,  .  .  .  in  equation  (xxxvi.),  but  it  will  add  to  this  a  function  of  the  last  p — (/ 
system  of  c's,  i.e.,  the  c's 


Cq  HI       ^■qj^\^n'i         •       •        •       C,,  ,,. 

Whatever  this  function  may  be  we  will  represent  it  for  the  time  by  y,„, ;  we  notice 
that  it  is  independent  of  the  selected  variations  s^,  6'.,,  .  .  .  s^,  the  selected  means  h^, 
ho,  .  .  .  hq,  and  the  selected  correlation  coefficients  p^o,  /O03,  .  .  .  ^.  It  depends 
only  on  the  characters  before  selection. 

We  thus  have 

[R(;;)i,  a,  R(/;)o,  a,.  R  ] 

[  R  ( V  ),„  o-j  R  ( V ),,.  o-j  K  {v \ 

Now  the  system  «,„,  rz„_,,  .  .  .  a,,^  can  be  f<jund  from  (xli.),  since  =  «„„„  whatever 
w'  and  u"  be. 
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Hence  we  have 


Si 


R(4„,  R(->4„/^A^ 


R       R(i«)<m  \<^v 


Here  Sj  denotes  a  sum  from  _p  =  1  to  p  =     and  S.  a  snm  for  every  pair  of  values 
of  'p  and  ^"  out  of  1,  2,  8,  .  .  .  q. 
When  M  =  v  we  have  simply 


R(vVA2/.s 


It  only  remains  to  determine  y,„  and  y,,.  This  we  can  do  by  putting  all  the  s's 
zero,  or  selecting  our  q  organs  of  one  size  only.  We  see  at  once  that  y,,,,  and  y,,  are 
the  values  of  a„„  and  a,,,  that  is,  of  S„t,r,vo  and  2,",  when  we  select  q  organs  of 
definite  values  and  seek  the  correlation  and  the  variabilities  of  two  others,  the  m*''  and 
the         These  values  have  already  been  found  on  p.  10.    Or  : 


%?  =  y?=:o;m{uv)„„/^{q\ 
=  y,r  —  cr„^R  [q), 


(xlii.). 


The  notation  of  that  page  has  been  changed  so  that  R(t«j)  now  stands  for  the 
determinant 


^  1        '''l2)  '"iS) 


.  (xhii.). 


'R{uv)„„  is  the  minor  corresponding  to  the  constituent  ;  ^{uv)um  the  minor 
corresponding  to  the  constituent  at  the  meet  of  the  u*''  column  and  tt"'  row  ;  and 
H  {c[)  the  determinant  with  the  last  two  rows  and  two  columns  struck  out. 

For  example  : 


—  9 
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Jr. 

y,n, 
y„r 

y,-r 


=  0-/(1 


=  cr 


0-/(1  -  r,/  ~        -       +  2r,,7V,7-,„)/(l  -  r,/), 


=  o-,,"-  X 


yv„  ~  o-,/^  X 


cr,a-„ 


(1  - 

—  ''l 

I, 

1, 

J'2:!, 

n>- 

'V2, 

1, 

^^■:), 

1 

1, 

'"12, 

^■•21, 

1, 

'>':'.2, 

1, 

^ /'2> 

1 

1, 

^'13, 

^•]. 

''"21, 

1, 

1, 

r^r 

(xliv.). 


W e  can  now  collect  our  complete  results. 

The  variability  of  a  non-selected  organ  y  >  is  after  the  selection  of  q  organs 
given  by 


2„-  =  ar,r  i  ^,777"  +  ^1 


E(./) 


+  2S. 


I 


.  (xlv.) 


The  correlation  of  two  non-selected  organs  v  and  u  both  >  ^  is  after  the  selection 
of  q  organs  given  by 


-4-  S  ^  — 
11  (rj)  1  lR(r),„ROO™\<^./ 


~  1  \  R R  R (7;),.,.  R  (n)„„  /        c7,<T,.,  J  _ 


.  (xlvi.). 


The  correlation  between  a  non-selected  organ  ii.  >  q  and  a  selected  organ  v  < '/  -j-  I 
is  given  by  (xli.)*"'  or, 


,'R  {ii.)j,„, 


(xlvii.). 


*  The  expanded  values  of  these  determiiiaiits  are  given,  '  Phil.  Trans.,'  A,  vol.  LS7,  p.  294, 
VOL.    CC,  A,  p 
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Here  in  :  2)  takes  eveiy  value  from  1  to  q,  and  in  So  :  j)'  and  'p'  every  possible  pair 
of  values  from  1  to  q.  Equations  (xlv.  )-(xlvii.)  fully  determine  all  the  required 
quantities  and  form  the  full  solution  of  the  pi^oblem  of  selection.  Before  we  see  the 
remarkably  simple  forms  they  take  for  simpler  cases,  we  may  draw  some  general 
conclusions  of  a  most  important  character. 

In  the  first  place  we  must  distinguish  between  directly  selected  and  what  we 
have  termed  non-selected  organs.    It  would  be  better  to  term  the  latter  indirecthj 
selected  organs.    Suppose  the  recruiting  sergeant  were  to  pay  attention  only  to 
stature  and  seek  to  form  a  regiment  of  men  of  about  5  feet  10  inches.    He  might 
have  a  real  range  of  stature  about  6  or  7  inches,  but  he  would  strive  to  get  men 
of  about  this  height  from  the  population.     We  will  suppose  that  he  did  not  consider 
chest-breadth,  head-length,  foot-length,  lungs  or  any  other  character.    The  distribu- 
tion of  these  "  non-selected  "  characters  in  the  i^egiment  would  not  be  the  same  as  in 
the  general  population.    Their  means  would  have  changed  by  (xxxvi.)  and  their 
variabilities  and  correlations  be  given  by  (xlv.)-{xlvii.).    In  other  words,  an  indirect 
selection  would  have  taken  place.    A  selection  by  stature  would  change  foot-length 
and  head-length  and  indeed  every  other  correlated  organ.    Much  the  same  result  must 
occur  in  natural  selection.    If  it  be  advantageous  for  a  species  to  have  a  certain  group 
of  its  organs  of  definite  size,  falling  within  a  definite  range,  and  related  to  each  other 
in  a  definite  manner,  then  these  changes  cannot  take  place  without  modifying  not  only 
the  size,  but  the  variability  and  correlation  of  all  the  other  organs  correlated  with  these, 
although  these  organs  themselves  be  not  directly  selected.    Practically  this  means 
all  the  other  organs,  for  so  far  one  can  hardly  say  with  certainty  that  we  have  come 
across  any  two  characters  in  an  organism  which  are  uncorrelated.    Many  of  those 
investigated  are  highly  correlated,  all  appear  to  have  some  corfelation,  even  if  it  be 
very  small  or  negative. 

We  may  therefore  conclude  as  follows : — 

(a)  The  selection  of  any  complex  of  characters  or  organs  in  an  organism  changes 
all  the  other  characters  and  organs  not  directly  selected. 

(/>)  If  the  change  in  the  complex  be  continuous  and  progressive,  the  other 
characters  will  continue  to  be  modified  until  the  change  in  them  is  so  considerable 
that  selection  begins  to  act  directly  upon  them  also. 

(c)  The  changes  noted  here  are  not  confined  to  the  average  value  of  a  non- 
directly  selected  character  and  to  its  variability  ;  the  correlations  between  non-directly 
selected  characters  and  the  correlations  between  directly  and  non-directly  selected 
characters  are  also  both  chanp-ed. 

{d.)  If  local  races  have  been  produced  by  selection  from  a  common  stock,  it  will 
be  impossible  to  look  upon  correlation  as  a  criterion  for  species.  Every  selection  will 
modify  such  correlation,  and  it  has  no  greater  fixity  than  either  type  value  (mean) 
or  variability  (standard  deviation). 

The  whole  of  these  statements  will  become  more  manifest  as  we  apply  our  genertd 
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tlieorem  tO' special  cases,  but  we  must  note  that  if  two  organs  were  uiicorrelated  with 
each  other,  it  still  might  be  possible  by  selecting  a  third,  or  a  third  and  fourth,  to 
produce  correlation  betweeji  them.  Further,  by  selection  of  one  or  more  organs,  two 
non-directly  selected  organs  can  have  their  existing  correlation  increased,  lessened  or 
even  changed  in  sign. 

(4.)  A  primary  difficulty  will  of  course  arise  in  the  case  of  natural  selection.  Hom^ 
are  we  to  determine  which  are  the  directly  and  which  are  the  indirectly  selected 
organs  ?  With  artificial  selection  by  man,  we  know  which  organs  have  been  selected 
fairly  well ;  attention  has  been  paid  to  colour,  size,  proportion  of  parts,  &c.  Even  in 
the  case  of  the  medical  examination  of  the  recruit,  it  is  chest,  lungs,  heart,  stature, 
&c.,  which  form  the  basis  of  the  acceptance  or  rejection.  If  the  head  or  foot  be  not 
absolutely  deformed,  little  if  any  attention  is  paid  to  them,  so  with  hair-colour, 
probably  eye-colour,  and  a  mass  of  other  details.  No  doubt  the  direct  medical 
selection  indirectly  selects  these,  but  we  could  roughly  class  the  selected  and  non- 
selected  organs  or  characters  and  investigate  the  changes  in  the  correlations  of  the 
latter  owing  to  the  indirect  selection.  But  how  are  we  to  form  these  classes  in  the 
case  of  natural  selection  ? 

The  investigations  may  look  difficult,  and  even  from  the  standpoint  of  arithmetic 
appalling,  but  it  seems  to  me  that  the  dififerentiation  of  organs  into  directly  and 
indirectly  selected  classes  is  the  keynote  to  the  problem  of  evolution  by  natural 
selection. 

Let  us  look  at  a  simple  case  and  see  whether  it  will  throw  any  light  on  the 
problem  of  distinguishing  between  directly  and  indirectly  selected  organs.  Suppose  we 
have  two  organs  only,  with  means  m^,  m.^,  standard  deviations  cr^,  cto,  correlation  r^.,, 
and  let  the  first  be  selected  so  as  to  have  a  mean  value  m-^  +  h^,  and  standard 
deviation  s^.  Let  So  be  the  standard  deviation  of  the  second  organ  and  the 
correlation  of  the  two  organs  after  selection,  -and  m.^  +  .x'.  the  mean  of  the  non- 
selected  organ. 

Then  by  (xxxii.)  : 

and  it  will  be  shown  later  (see  p.  23)  that 


3  / '  12  j> 


and 

Hence  we  have  : 
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III  other  words  the  regression  coefficient  of  the  non-selected  organ  on  the  selected 
remains  unchanged,  while  that  of  the  selected  organ  on  the  non-selected  will,  as  a 
rule,  be  widely  modified. 

Further,  let  be  the  mean  value  of  tlie  second  organ  before  selection  corresponding 
to  a  value  of  the  first ;  let  and  Mn  be  the  means  of  the  organs  after  selection, 
and  Yo  be  the  mean  value  of  the  second  organ  corresponding  to  a  value  Kj  of  the 
first.    Then  the  equation  to  the  regression  line  before  selection  is 

Xo  =  r.:,  —  H,  -f  III,  —  r,  —  m^, 

and  after  selection  it  is 

Y,  =  r,3  K,  +  M,  -  r,,  ?^  M„ 

=         Kj  +  m...  -f      —  /'i  —      ^  [iii^  -f 

cr,  a. 

But  this  is  identically  the  same  line  as  the  regression  line  before  selection.  Hence 
not  only  the  slope  (regression  coefficient)  of  the  line,  but  its  position  is  identical,  and 
we  have  the  following  result  :  — 

Iftivo  local  races  Jiave  been  evolved  from  a  single  stock  hy  the  selection  in  different 
ways  of  one  organ  only,  then  the  regression  lines  for  the  two  races  of  any  non-directly 
selected  organ  on  the  directly  selected  organ  ivill  he  the  same  in  direction  and  position; 
hut  the  regression  lines  of  the  selected  organ  on  any  non-selected  organ  tvill  differ  for 
the  two  races  * 

Of  course  the  means,  standard  deviations  and  correlations,  not  only  of  the  selected 
organ  but  of  all  the  non-selected  organs  also,  will  jjrobably  have  changed.  It  is  only 
certain  of  the  regression  lines  which  remain  unchanged  and  serve  as  a  criterion  to 
enable  us  to  distinguish  between  directly  and  non-directly  selected  organs. 

Of  course  the  problem  in  Nature  will  not  be  as  simple  as  this,  for  difierentiation  of 
the  two  local  races  may  have  arisen  from  the  selection  of  more  than  one  organ,  or 
may  have  arisen  from  the  selection  of  two  difterent  organs,  but  the  illustration  will, 
I  think,  indicate  the  nature  of  the  investigation  we  are  jjroposing. 

We  can  easily  generalise  our  theorem  by  considering  the  form  of  the  selection 
surface  given  on  p.  12.  Any  result  obtained  from  (xxxv.)  which  does  not  involve  any 
of  the  c's  will  be  a  result  unaffected  by  the  selection  that  has  gone  on.  Now  to 
obtain  a  regression  equation  we  put  any  number  of  the  xs  equal  to  constants,  to  h's 

*  The  geometrical  iiiteqjretutiou  in  this  simple  Ciise  that  the  regression  line  is  unchanged  is  quite 
ol)vious,  and,  indeed,  may  serve  as  a  proof. 
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say,  and  find  the  "  centre  "  of  the  quadric  of  the  remainmg  x's,  the  co-ordinates  of 
this  centre,  expressed  in  terms  of  the  h's,  are  the  regression  equations.  Now  it  will  be 
clear,  that  if  we  put  all  the  selected  xs  equal  to  h's,  the  differentials  of  the  quadric 
with  reo-ard  to  the  remaining  or  non-selected  xs  can  contain  no  c's  or  the  coefficients 
of  the  regression  equations  thus  found  will  not  be  modified  by  selection. 

Further,  we  might  have  given  not  only  the  selected  organs,  but  any  number  of  the 
non-selected  organs  constant  values,  and  the  resulting  regression  equations  would 

involve  only  the  c's  and  not  the  c's. 

Hence  we  have  the  following  general  theorems  :  — 

(i.)  If  an  organ  has  been  modified  only  hy  indirect  selection,  then  its  imrtial 
regression  coefficients  on  any  complex  of  other  organs,  however  large  or  small,  provided 
it  includes  all  the  directly  selected  organs,  ivill  remain  unchanged  hy  the  selection. 

(ii.)  The  same  organ  in  tivo  different  local  races  which  have  been  derived  from  a 
common  stock  by  the  selection  of  tivo  complexes  of  organs,  some  of  ivhich  may  or  may 
not  be  common,  will,  if  it  has  not  been  directly  selected  in  either  case,  give  the  same 
partial  regression  coefficients  for  any  group  of  organs  which  includes  the  members  of 
both  complexes  and.  any  number  of  nan- directly  selected  organs  besides. 

If  the  partial  regression  equations  have  changed  coefficients,  then  we  cannot  at 
once  determine  whether — 

(a)  We  are  dealing  with  a  non-directly  selected  organ,  and  have  not  included  all 
the  directly  selected  organs  in  the  group  upon  which  we  are  calculating  the  regres- 
sion ;  or 

[b]  We  are  really  dealing  with  a  directly  selected  organ.  In  this  case,  we  have 
also  certainly  not  included  at  least  one  directly  selected  organ  in  the  regression 
group. 

Theoretically,  however,  (i.)  and  (ii.)  suffice  to  find  out  which,  if  any,  are  the  non- 
directly  selected  organs  in  the  differentiation  of  local  races.  Practically,  however, 
the  number  of  organs  and  characters  may  be  so  great,  and  our  ignorance  of  those 
probably  selected  so  complete,  that  the  arithmetic  of  determining  so  extensive  a 
series  of  partial  regression  coefficients  may  be  quite  beyond  our  powers.  Still,  where 
the  divergence  between  local  races  is  not  too  great,  and  the  source  of  the  differentia- 
tion not  too  obscure,  it  is  probable  that  the  above  theorems  will  lead  to  results  of 
great  interest.* 

Without  laying  too  great  weight  on  these  theorems,  I  would  still  venture  to 
suggest  that  if  the  criterion  of  a  species  be  the  discovery  of  any  numerical  constant 

*  Mr.  L.  Bramley-Moore  has  been  working  with  this  end  in  view  at  the  long-bones  in  man.  But 
even  here  the  direct  selection  of  parts  of  the  vertebral  column — for  which,  at  present,  we  have  no  correla- 
tion values  either  among  themselves  or  with  the  long-bones — and  of  the  hand  and  foot,  which 
Dr.  W.  R.  Magdonei  t,  has  just  shown,  are  very  highly  correlated  with  the  long-bones,  may  render 
nugatory  all  attempt  to  ascertain  which,  if  any,  long-bone  has  been  only  indirectly,  or,  at  any  rate,  least 
directly  selected. 
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or  group  of  constants,  whicli  is  the  same  for  all  local  races,  then  these  constants  must 
not  be  sought  in  the  values  of  mean  characters,  degrees  of  variability,  or  of  correlation, 
but  in  a  system  of  partial  regression  coefficients,  and  the  discovery  of  these  is  therefore 
of  first  class  biological  importance  ;  it  is  the  classification  of  the  characters  into  directly 
and  non-directly  selected  groups,  i.e.,  it  is  the  discovery  of  the  modus  operandi  of  the 
factors  by  which  the  differentiation  has  taken  place.  We  are  a  long  way  from 
solution  yet,  but  we  may  venture,  perha2)s,  to  admit  a  faint  glimmer  of  light  in  the 
direction  of  what  might  seem  the  culminating  j^roblem  of  the  mathematical  method 
as  applied  to  evolution — the  piecing  together  by  quantitative  analysis  of  the  stages  of 
descent. 

(5.)  I  will  take  now  the  application  of  the  above  results  to  simple  cases  ;  but  for  the 
benefit  of  those  who  cannot  easily  follow  the  main  principles  of  our  investigation 
through  the  stages  of  determinant  analysis,  I  will  prove  directly  the  proposition  that : 
the  selection  of  an  organ  A  alters  the  mean  and  variahility  of  a  correlated  organ  B, 
a.nd  also  tlie  correlation  between  A  and  B, 

Let  the  correlation  surface  for  the  two  organs  be 


N 


1  _  1  /  _  2)'L.i.-e|.<.o  +  X/  \ 
 ^  e    ^"•i-'(l-'V/)      <ri<r,(l-,-,,-^)  <raMl->-u^)/ 


where  N  is  the  number  of  individuals  in  the  general  population  before  selection,  and 
the  subscripts  1  and  2  refer  to  the  organs  A  and  B  respectively. 

liet  the  distribution  of  the  population  after  selection  of  the  A  organ  be 


\/  2-77  S, 


n 


where  N  —  n  is  the  total  destruction,  /i^  the  mean  and  s^  the  variability  of  the 
population  with  regard  to  A  after  selection.    Before  selection  this  distribution  was 

V  27ro-j 

Hence,  the  selection  being  random  with  regard  to  the  array  of  B's  corresponding  to 
any  A,  we  have  for  the  surface  after  selection 

Z  =  2  X  z'jz^, 

for  each  array  must  be  altered  in  the  ratio  of  the  corresponding  z'  to 
This  gives  for  the  surface  in  full 

7  _     »  ,  j    0/1  J  1        \  2i\c^Xyj:.-,  

<^2   (1  —^12)  *1  I 
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Integrate  this  for  eveiy  possible  value  of  and  we  shall  have  the  distribution  h.^  and 
So  of  X..  or  B  after  the  selection  of  A.  After  some  reductions  we  find  for  the  frequency 
CSx, 


n 


1  1  /-,  _'M'-i2<r,Y 


V 

where 


S.2  =  o-.Ml  -(1  -''Vlrj.M  (xlix.), 


which  gives  the  standard  deviation  of  the  indirectly  selected  organ,  and  the  "  centre  " 
of  this  organ  is  given  by 

h.  =  '^h,  (1.). 

These  are  special  cases  of  our  results  (xlv.)  and  (xxxvi.)  above  respectively. 

Further,  returning  to  the  correlation  surface  (xlviii.),  the  coefficient  of  correlation 
I'lo  is  the  coefficient  of  x^x.^  divided  by  the  prodiict  of  the  coefficients  of  x^'  and  a?/. 
Hence  m'b  find 

ri,  =  ^-  -7— rr-    .......  (li.). 

a 


Let      =  sin  0^.,,  x^.,  =  sin  (^^o,  then  we  have 

tan^i^  =  ^'  tan  0,^   (hi-). 

This  shows  us  that  (ji^o  decreases  with  s^,  or  that  Vj.j  decreases  with  s^,  that  is  to  say, 
the  more  intense  the  selection  the  less  is  the  correlation.  This  in  broad  terms 
demonstrates  the  general  principle  that  intensity  of  selection  connotes  a  lessening  of 
correlation.  Tt  is  this  principle  which  very  possibly  accounts/lbr  the  fact  that  the 
more  civilized  races  of  man  appear  to  be  not  only  more  variable  but  more  highly 
correlated  than  the  less  civilized,  among  whom  the  struggle  for  existence  is  more 
intense.  It  may,  perhaps,  also  account  for  the  skeletons  of  women  of  the  civilized 
races  having  their  parts  more  highly  correlated  than  the  parts  of  those  of  men.* 
Lastly,  it  may  w^ell  throw  some  light  on  the  markedly  plastic  character  of  races 
which  have  been  stringently  selected  with  regard  to  one  or  a  few  organs  only. 

As  an  illustration,  suppose  that  the  correlation  between  femur  and  tibia  were  "7, 
and  let  us  investigate  what  would  be  the  effect  of  reducing  the  variability  of  the  tibia 
by  direct  selection  50  per  cent.  We  find  at  once  on  substituting  sja-i  =  "5  and 
r^o  =  '7  in  (li.)  above  that  x^^  =  '44,  or  a  reduction  of  about  37  per  cent.  This  will, 
perhaps,  be  sufficient  to  indicate  what  immense  changes  must  be  made  in  the 
correlation  of  highly  correlated  organs  whenever  selection,  artificial  or  natural,  is 
stringent.  It  is  important  to  notice  that  the  change  in  the  size  of  the  organ  in  no 
*  See  Lee  and  Pearson,  '  Roy.  Soc.  Proc.,'  vol.  61,  p.  .3.54: ;  and  Lee,  'Phil.  Trans.,'  A,  vol.  196,  p.  2.31. 
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way  influences  the  change  in  the  correlation  between  organs,  if  the  distribution  be 
normal,*  the  change  depends  only  on  the  stringency  of  the  selection.  Breeders  who 
select  by  the  size  of  an  organ  only  are  in  that  case  very  likely  to  reduce  the  variability 
of  the  organ  in  the  selected  group  by  far  more  than  50  per  cent.  Accordingly,  it  is 
not  to  be  wondered  at  if  they,  to  a  great  extent,  destroy  the  correlation  between  the 
selected  organ  and  other  organs.  This  destruction  would  appear  as  a  want  of  relation- 
ship between  parts,  possibly  as  a  want  of  fixity  in  type. 

By  means  of  equation  (Hi. )  x^.  can  easily  be  found  from  .  and  p  =  ^i/cr,.  All  we  need 
is  a  table  of  trigonometrical  functions.  We  observe  that  r.o  is  always  of  the  same 
sign  and  less  than  Vio.  For  many  biological  purposes  the  following  graphical  con- 
struction gives  quite  sufiiciently  accurate  results.  Let  CAB  be  a  quadrant,  say  of 
10  centims.  radius,  and  take  the  point  P  on  this  quadrant  distant  PN  =  lO^i^ 

from  AB.    Take  QN  =  -  PN,  and  let  AQ  meet  the  quadrant  in  R,  then  BM  the 

P 

distance  of  R  from  AB  =  lOr^^,,  and  consequently  determines  x^^.  If  the  figure  be 
drawn  on  decimal  paper  the  determination  oft,.-,  is  peculiarly  easy. 

Graphical  methorl  of  finding  correlation  between  organs  A  and  B  after  selection  has  acted  on  A. 

ScaOe  cfr,g  and  R,^  . 


B 

In  the  above  example      =  PN  =  -7 ;  Sijcr  =  -ii  =     and  therefore  QN  =  rjo/»  =  -35. 

P 

Xi-i  =  Ri2  =  RM  =  -44,  as  before.    See  p.  23. 


*  It  will  do  so  if  the  distribution  be  skew,  see  'Phil.  Trans.,'  A,  vol.  191,  p.  231, 
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Further,  taking  AP  as  our  unit,  AN^  =  1  —  r^/,  and  QN^  =  r^/  X  — .  Hence, 
fr-om  (Ixix.)  : 


V  3 


Therefore  if  Acr^  in  the  diagram  be  taken  equal  to  o-^  and  QS^  be  drawn  parallel 
to  Ro-g,  we  shall  have  ASo  =  Sg,  or  we  can  scale  off  the  reduced  variability. 

Thus  the  diagram  enables  us  to  see  at  a  glance  the  reduction  in  correlation  and 
variability. 

(6.)  Let  us  now  write  down  the  results  when  an  organ  A  is  selected  out  of  a  group 
of  three  organs,  A,  B,  C,  whose  constants  are  marked  by  the  subscripts  1,  2,  3, 
respectively.  Let  —  sjai,  and  be  represented,  when  required,  by  cos  Xi-  Then 
we  find  from  (xlv.)  (xlvii.)  : 

 (liii-), 


^  ^^'1,3'^  K  =  ^3  {1  -  sin^  Xi  cos^  ^13}^ 


=  a-,  n  -  1  - 


^13  — 


^13  — 


/il^ij    COS  Xl  C0S^j2 


1^33  — 


{1  -  (1  -  f,,')  x/1  -  Sin3  Xy  COS^  ^12 

 fh^'is   cos  xi  cos 

{1  -  (1  -  _  sin^  cos^"^,; 


1  - 


'  13 


 COS      +  sin^  xi  cos  ^13  cos  0^,^ 

-  Sin2  Xi  C0S^%2)  (1  -  Sin2      cos2  6/13) 

where,  as  before,  we  write      =  cos         Let  us  also  write  r,.,  =  cos  O^,,  and 
sin  ^1  cos  6'i3  =  cos  a^g,       sin  Xi  cos  (^^g  =  cos  a^g. 

Then  we  can  replace  the  above  results  by 

^3  =  0^3  sin  a^jj,  =  a-g  sin  ct^g, 

cos      =  cot  xi  cot  a^,,       cos  ©^g  =  cot  xi  cot  rt^g. 


VOL.  CC. — A. 


^         cos      —  cos  fl,i  COS  a,., 

cos  0og   =  - 

sin      sin  0^3 
E 


(liv.), 


(iv.; 


(Ivi.), 


(Ivi!.). 
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These  equations  admit  of  easy  interpretation  by  spherical  geometry. 

P 


E  F 

Let  P  be  the  pole  of  the  great  circle  DEFG.  Take  DG  =  6'o3,  DE  =  0,.^,  GF  =  dy,. 
Join  P  to  E  and  F  ;  let  the  small  circle  of  radius  round  P  meet  PD,  PE,  PF, 
and  PG  in  cZ,  e.f^g  respectively.  Draw  the  arcs  De  and  Qif.  Let  the  small  circles 
with  centres  D  and  G  and  radii  Dc  and  Gr/ respectively  meet  in  Q.  Join  DQ  and  GQ. 
Then  the  quantities  required  are  : 

So/o-o  =  sin  DQ,  ^3/<^3  =  sin  GQ. 

i-j.  =  cos  DeE,  =  cos  F/G,  =  cos  DQG. 

For  DE  =  ^10,  Ec  =  y  -  xi,  Z.  DEe  =  y  ; 

hence  :       cos  De  =  cos  Q^i  —  ^^^^  Xi  "^^s  ^^.j  =  cos  a^^j  ^i"  ^h-i  —  5 

similarly  =  Qnf. 

Next,  cos  DeE  =  cot  De  tan  eE  =  cot  ct^o  cot  Xi,  or  l_  DeE  =  0io ;  similarly  <  Yf(j—%^^. 
Lastly,  from  the  triangle  DQG  :       DQ  =  a^^,  QG  =  a^g,  and  DG  =  ^33,  but 

cos  DG  =  cos  DQ  cos  QG  +  sin  DQ  sin  QG  cos  DQG, 

cos         —  cos  «,.,  cos  ft,.,  ^  -r^r^r^  ^ 

or,  cos  DQG  =  \  '-^  =  cos  0^3 ;  or  /.  DQG  =  ©23- 

Thus  all  the  relations  can  be  expressed  in  terms  of  the  sides  and  angles  of  a  simple 
system  of  spherical  triangles.  For  the  degree  of  accuracy  generally  possible  in 
biological  and  sociological  investigations  these  triangles  can  be  solved  by  a  spherical 
trigonometer,  such  as  that  sold  by  Kreidl,  of  Prague."*  The  changes,  however, 
which  i%3  imdergoes  for  various  values  of  r^,  r^^,  are,  indeed,  far  more  difficult 
to  appreciate  as  a  whole  than  those  of  I'^o  or  Xi^.  In  order  that  they  may  be  followed 
easily,  and  in  order  to  solve  directly  to  a  degree  of  approximation  sufficient  for  many 
practical  purposes  problems  in  the  influence  of  selection  on  correlation,  my  assistant, 
Dr.  L.  N.  G.  FiLON,  has  kindly  drawn  up  the  tables  which  accompany  this  memoir. 
*  It  will  suffice  fairly  well  for  all  but  a  few  special  values  of  ri  >,  r-zs,  isi. 
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Of  course  to  bring  them  within  any  reasonable  compass  we  have  had  to  Hmit  the 
values  taken.  In  the  first  place  we  have  considered  only  eleven  grades  of  selective 
stringency  given  by 

Si/o-i  =  0,  1/10,  2/10,  3/10,  4/10,  5/10,  6/10,  7/10,  8/10,9/10,  1, 
the  corresponding  values  of  tog  in  the  tables  are  entered  as 

Ro,   Rl>   1^2=   1^3,  Rc>   ^7'  1^8.   Rs),  ^10- 

The  tables  are  calculated  for  r^g  both  positive  and  negative,  but  and  r^g  are 
always  supposed  positive.  If  r^^  ^^^^  '"is  ^^^^^  negative,  then  tog  will  be  the  same 
as  if  they  were  both  positive.  If  r^g  and  rjg  be  of  opposite  signs,  then  all  we  have 
to  do  is  to  look  out  tsg  in  the  table  in  which  r^^  has  a  sign  the  reverse  of  its  actual 
value,  and  having  found  the  corresponding  value  of  r^g,  then  change  its  sign  to  obtain 
the  actual  coefficient  of  correlation  after  selection.  This  follows,  if  r^o  be  the 
negative  coefficient,  by  writing  : 

r^s  +  r,,o\o.s^^'Xi  _  _  (-^'23)  +  (-^'13)  (^12)  sin^% 

Lastly,  it  would  clearly  be  very  laborious  to  tabulate  r^g  for  a  very  great  series  of 
values  of  rj^,  r^g,  rgg.  Accordingly  a  selection  had  to  be  made  of  these  coefficients  of 
correlation.  They  were  given  the  values  0,  -25,  '5,  75,  and  1.  These  may  be  spoken 
of  as  zero,  small,  medium,  large,  and  perfect  correlations,  and  the  ranges  0  to  "25, 
•25  to  -5,  -5  to  '75,  and  75  to  1,  as  the  ranges  of  little,  moderate,  considerable,  and 
high  correlation  respectively.  There  would  thus  appear  to  be  15  combinations  of  values 
for  ^12,  ?']3 ;  these  are  given  in  the  key  to  the  tables  as  (a),  (6),  (c),  {d) .  . .  (to),  {71),  (p), 
see  p.  63.  If  these  15  values  had  to  be  combined  with  the  10  values  (5  positive 
and  5  negative)  of  r^g  and  the  11  values  of  Si/cr^,  we  should  have  1650  entries  in  our 
tables.  But  this  number  is  much  reduced  by  the  consideration  that  the  expression 
1  —  r^^^  —  rj^g^  —  r,3^  +  ^r^^Ti^r^^  for  the  real  correlation  of  three  characters  to  be 
always  positive,  x^z  can  also  never  be  greater  than  unity.  Accordingly  all  values  of 
rgg,  7'j3,  rj2,  which  do  not  satisfy  these  conditions,  have  been  excluded  from  the  tables  ; 
they  cannot  arise  in  nature.  A  few  impossible  values  of  rgg  have  been  included  in 
the  tables,  but  these  are  placed  there  solely  for  the  purpose  of  finding  by  interpolation 
values  of  tgg,  which  are  less  than  unity.  The  following  purely  hypothetical  illustrations 
of  formula  (Ivi.)  and  the  tables  will  serve  to  indicate  their  use. 

Illustration  I. — Suppose  the  correlation  of  tibia  and  femur  with  each  other  to  be 
•8,  and  of  both  with  the  stature  to  be  "6.  How  would  their  correlation  be  altered  if 
the  variation  in  stature  were  reduced  by  selection  to  half  its  present  value  ? 

Let  sja--^  =  ju^  as  before,  and  suppose  r^g  =  R  ;  then  let  /x.^,  rjg,  -j'^g,  r^g  be  the  values 
of  the  constants  next  heloiv  the  required  values  occurring  in  the  tables,  and  giving 

E  2 
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toy  =  R  ;  let  R'  be  the  true  value  of  x.i,,  corresponding  to  the  values  /x,  +  Sfx^,  r-^,  +  Sr^o, 
^'i3  +  ^''i3'  ''-33  +  ^'■23-    Thus  we  have,  as  far  as  first  differences  : 

R'  =  R  +  10  (A^ R)  Sf.,  +  4  |(A,„^R)  Sr,3  +  (A,  R)  Br,,  +  (A.^^R)  Br,,]. 

In  our  case  jx,  =  yo,  r,o,  =  '5,  r,,  =  '5,  r,,  ~  '75,  Bfx,  =  "05,  Br,,  =  Br;,  =  '1, 
87-03  =  "05.  Further,  we  look  up  Table  IV.  (a),  and  the  nearest  case  is  (j)  under  R^, 
which  gives  R=  -67105.  We  then  see  that  (A^^R)  +  (A;.,3R)  =  difference  between 
(_;■)  and  {7/1)  cases  =  —  '21455;  (A;.^  R)  =  diflPerence  between  result  in  IV.  (a)  and 
V.  (a)  =  -32895  ;  and  lastly  (A^R)  =  difference  between  R3  and  Rg  columns  of  (j) 
row  of  Table  IV.  (a)  =  -00535.    Thus  we  find  : 

R'  =  -671050  +  -002675  -  '04291  +  -06578  =  -6966. 

The  value  by  straightforward  calculation  of  formula  (Ivi.)  is  '6981,  the  two  results 
giving  substantially  the  same  value  '7.  Thus  we  see  that  such  a  selection  would 
reduce  the  correlation  of  tibia  and  femur  by  12 '5  per  cent. 

Illustration  (11.). — Suppose  the  correlation  of  humerus  and  femur  to  be  '5,  and  of 
those  with  stature  to  '7  and  '8  respectively.  How  would  the  correlation  of  humerus 
and  femur  be  modified  by  a  selection  of  stature  given  by  sja;  =  "5  ? 

In  this  case,  fji,  =  '5,  r„  =  -5,  r„  =  -75,  r„  =  '5,  Bfi,  =  0,  Br„  =  '2,  Br,,  =  -05, 
Br.2,  =  0.  We  turn  to  Table  III.  (a)  and  take  out  (k)  under  Rg,  which  gives  us 
R=  -3192.  We  have  A,^ R  =  —  -1841  and  A,. 3R  =  —  -04185,  whence  we  find 
R'  =  -1636,  but  the  differences  of  the  table  are  too  great  at  this  point  for  the 
result  to  be  very  trustworthy,*  Suppose  we  take  jx,  and  r,,  as  before,  but  r,,  =  -75, 
^-jg  =  -75,  and  therefore  R,  to  be  found  from  (m),  =  -1351  ;  then  Br,,  =  —  -05, 
Br„  =  -05  and  A^.^  R  =  —  *1841  as  before,  A,.^  R  =  —  -2995.  Hence  we  deduce 
R' =  -1120.  The  mean  of  these  two  values  of  R'  is  -1377,  and  the  true  value 
calculated  from  (Ivi.)  is  R'  =  -1395.  Taking  -14  for  the  practical  value,  we  see  that 
the  correlation  of  humerus  and  femur  has  been  reduced  by  this  comparatively 
moderate  selection  of  stature  upwards  of  70  per  cent. ! 

Illustration  (III.). — -Suppose  a  case  in  which  humerus  and  femur  were  not  correlated, 
but  that  both  were  correlated  '7  with  stature.  What  would  be  the  effect  of 
selecting  stature  with  the  same  intensity,  i.e.,  s,/<t,  =  "5  ? 

Our  best  results  from  the  tables  will  be  to  take  R5  (m)  from  Table  I.  (a),  which 
gives  R  =  —  -7297.  We  have  then  r,,  =  r,,  =  -75,  hence  §^13=  —  -05,  §^"13=  —  -05. 
A;.jjv  is  to  be  found  from  (in)  and  (/;-)  and  =  —  -3193  =  A^-^^R,  and 

E'  =  -  -7297  +  (A,  R)  Br,,  +  (A,3R)  Br„, 

=  -  -7297  +  -1277  —  -  -6020. 

The  actual  value  by  formula  is  —  -5810. 

*  Second  differences  ought  to  be  used,  and  the  process  indicated  is  practically  equivalent  to  using 
Ihem, 
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Now  this  again  is  a  remarkable  result ;  by  selecting  an  organ  correlated  with 
two  others,  neither  of  which  are  correlated  with  each  other,  we  have  produced  a 
considerable  correlation,  and  what  is  more,  one  of  a  negative  sign. 

In  other  words,  if  humerus  and  femur  were  unrelated  to  each  other,  but  were 
related  to  stature,  then  a  selection  of  stature  would  result  in  men  of  long  femur 
havinp-  a  short  humerus,  and  vice  versa. 

Illustration  /F.— Suppose  the  correlation  between  greatest  length  and  breadth 
of  the  skull  to  be  '25,  and  between  both  and  the  auricular  height  to  be  -5.    Now  let  a 

stringent  selection,  —  =  yo,  of  height  take  place.    What  modification  will  there 

be  of  the  length  and  breadth  correlation  ?  - 
Table  II.  (a),  R^,  case  {j)  gives  us  at  once  the  result — 

ros  =  '0033. 

In  other  words,  the  correlation  between  length  and  breadth  would  be  sensibly 
destroyed  by  such  a  selection.  Thus  correlation  can  be  created  or  destroyed  or 
reversed  by  selection. 

The  above  illustrations,  hypothetical  though  they  may  be,  will  suffice  to  indicate 
how  entirely  dependent  correlation  is  upon  selection.  We  must  look  upon  coefficients 
of  correlation,  in  fact,  as  just  as  much  the  outcome  of  selection  as  coefficients  of 
variation,  standard-deviations,  or  even  the  mean  size  of  organs.  No  selection  can 
take  place,  in  the  sense  in  which  it  has  usually  been  understood  to  take  j^lace— -z^.e., 
by  a  change  of  mean  and  of  variability,  without  at  the  same  time  the  means, 
variabilities  and  the  correlations  of  all  correlated,  but  not  directly  selected,  organs 
being  varied.  This  is  true  whether  the  non-selected  organs  be  initially  correlated  or 
not  among  themselves.  We  must  always  bear  in  mind  this  all-important  funda- 
mental conception,  that  natural  or  artificial  selection,  or  even  random  sampling,  are 
in  themselves  active  factors  in  the  modification  [i.e.^  creation,  destruction,  or  reversal) 
of  correlation.  Thus  not  only  is  the  impossibility  of  the  constancy  of  correlation 
for  local  races  obvious,  but  the  primary  importance  of  insuring  that  our  samples  are 
representative,  and  not  accidentally  selected  samples,  in  all  observations  or  experiments 
on  heredity,  homotyposis,  or  organic  correlation  becomes  more  and  more  manifest. 
We  must  not  lay  too  much  stress  on  two  heredity  constants — differing,  for  example,  by 
more  than  the  probable  error  of  their  difference — unless  we  are  convinced,  which 
practically  it  will  be  difficult  to  be,  that  all  modification  of  correlation  by  unin- 
tentional and  unmarked  selection  has  really  been  avoided. 

(7.)  Let  us  now  take  the  next  most  simple  case.  J/ A,  B,  C,  D  be  four  mutually 
eorrelated  organs  [in  either  the  same  or  different  individuals),  and  a  selection  take 
place  of  A  and  B,  to  find  the  changes  in  the  characters  of  the  non-selected 
organs. 
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Let  subscripts  1,  2,  3,  4  mark  the  organs  A,  B,  C,  D  respectively;  let  A,,  /ig, 
s^,  So,  Pin  be  the  constants  which  determine  the  selection  of  A  and  B ;  and  let  us 
apply  (xlv,^  to  (xlvii.).  Here  the  coefficients  will  be  obtained  from  the  set  for  ^  =  2 
in  (xxxiii.)  and  (xliv.).    Hence  we  find  : 

+  ^^12  1-V     <^l-2J ^  ^" 


1  —  r^2~      1  —  rjj-    cTjO-o  J 

'''12")  ~  ~  ^'23^'2.t  +  ^'12  (^'l3^'34  +  ^'l-t^as) 

1  -  ''12' 

J     ''jVA^ZjVn  ^'u  ~  ^'24^'l2  /  i  \^     I     ^'23       '^13^'l2  ^24       '^14^13  /^^^ 


I     ^              —  '>'23^12  ^24  ~  ^'l4^'l-2  _■    ^14       '^'24^'l2  ^23        ^13^'l2  \   ^1^2  1  (\v  \ 

+  ^'2\l-n2^     1-V    "^"l-^'i2^      1- V-  yo-a<^2i'  •  ^ 

  o          J  '''14  ~  ^^2^*24    •"'l      1    '''24_^l^i£u  „       ^1  /Iyi'i"  \ 

S,S,rH  -  .9,cr,  |-  ^-3^  ^  +    1  -  r,,^         crj-     •     •     •  ^^^"•^^ 

^  -S?  ^             n          f ^'23  ~  '"^'l3^'l3    ^2      I     ^'l3  ~  ''^2^'23  ^       h_\  /Iviii  \ 

.,l3r,3  =  ^3^3  {"TZ^  -  +  1-,.,^         }   (ixiii.). 

o      ^            ^         f  '''24  ~  '''12^'u   %      I    ^'it  ~  '''12^24  ^      fL  1  Hviv  \ 

.,t,r,,  =  S3<x,,  [~Y^  7,  +  1  -  r,i  p^'^  r  •  •  • 

Finally,  for  the  change  of  means  of  the  non-directly  selected  organs,  we  have : 


J  _ 


13       '  lo' 


12'  23 


1  - 

''12 

0-1 

_  ^14- 

''l2'''24 

0"4 

''23  ~  '^'l2^'l3  ^ 
1  —  0-3  ^ 


1  -  ^'12' 


1  1  -  * 


''l2!l4  ""4  7, 

2  ''■2 
^^12  '^2 


(Ixv.). 
(Ixvi.). 


If  we  write  (Ixv.)  and  (Ixvi.)  in  the  form — 
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'      \  (Ixvii.), 

the  /8's  are  the  partial  regression  coefficients,  and  the  whole  solution  can  be  expressed 
in  terms  of  them.    Thus  : 

=  cr/  {1  -  ^,,r,,  -  ^,3-.3  +  A3^(^T  +  ^3^^  {vj  +  ^Pb^13^^3  ^Ij.  (Ixviii.). 

«lVl3  =  *'lO-3  1^13  ^  +  ^13)^33  J-  }  (IxX.). 

Thus  the  whole  series  of  results  can  be  easily  calculated,  if  the  regression  coefficients 
are  first  calculated. 

I  may  make  some  remarks  upon  these  results.  A  formula  equivalent  to  (Ixviii.) 
was  first  given  by  me  in  my  memoir  on  "Heredity,  Panmixia,  and  Regression" 
('Phil.  Trans.'  A,  vol.  187,  p.  303),  and  used  for  certain  problems  of  inheritance,  and 
conclusions  drawn  from  (Ixix.)  or  (Ixx.)  have  been  cited  or  indicated  in  other 
memoirs. 

Some  interesting  results  follow  at  once.  If  the  selection  be  very  stringent,  sjcr 
and  sjcr  =  0  sensibly,  then  all  correlation  between  a  selected  and  non-selected  organ 
is  destroyed.  But 

Y      _   '^'34  (1  ~  '''12'^)  ~  '^'13^14       '^'33^24  +  ''13  (^13^24  +  ^'u%)   Hxxi  ^ 

v/(l  -  ^-12^  -  '-23^  -  '13^  +  ^W,,)  v/(l  -       -  r,:^  "  'u'^  +  '^'>nr,,r,,)  ' 

This  is  what  I  have  termed  a  partial  correlation  coefficient — i.e.,  the  correlation 
between  0  and  D  when  fixed  values  are  given  to  A  and  B.  So  far  as  I  am  aware, 
such  coefficients  were  first  directly  used  by  Mr.  G.  U.  Yule  in  certain  economic 
problems.*  They  are  of  very  considerable  interest,  but  for  natural  or  artificial 
selection  are  not  quite  so  important  as  the  generalised  form  (Ixix.),  for  we  generally 
select  about  a  mean  value,  and  not  absolutely  at  it. 

It  will  be  noticed  that  the  coefficient  of  correlation  of  two  non-selected  organs 
differs  from  the  corresponding  partial  correlation  coefficient  by  terms  of  the  square 
order  in  5/cr,  but  the  coefficient  of  correlation  of  a  selected  and  non-selected  organ 

*  'Roy.  Soc.  Proc.,'  vol.  60,  pp.  485,  488;  'Economic  Jounuil,'  December,  1895,  and  December,  1896. 
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differs  from  zero  by  terms  of  the  first  order  in  sjcr.  Hence,  when  selection  is  intense 
(s/cr  small),  we  may  neglect  the  former  as  compared  with  tlie  latter,  and  we  have 
thus  the  basis  of  a  method  of  approximation  very  useful  in  some  cases. 

I  have  not  yet  succeeded  in  giving  a  geometrical  interpretation  to  the  above 
formulae,  but  have  little  doubt  that  it  would  be  quite  easy  if  the  "  spherical " 
geometry  of  four  dimensioned  space  were  more  familiar  to  me.  It  will  suffice  to 
observe  that  it  is  easy  to  find  cases  in  which  the  correlation  of  a  non-directly 
selected  organ  with  a  directly  selected  organ,  or  with  another  of  its  own  class,  can 
be  reduced,  destroyed,  increased,  or  reversed.  In  fact,  all  our  previous  warnings  as 
to  the  caution  necessary  in  avoiding  unintentional  selection  in  collecting  material  for 
testing  correlation  remain  in  force,  and,  indeed,  are  emphasised. 

The  following  illustrations  will  indicate  the  kind  of  problems  which  may  be 
attacked  by  such  formulae  as  the  above  : — 


(8).  Illustration  I. — Injiuence  of  a  Selection  of  the  Long  Bones  of  the  Leg  on  the 
Long  Bones  of  the  Arm,  and  on  the  Relation  of  the  Leg  to  the  Arm. 

A  numerical  example  will  throw  light  on  the  application  of  the  above  formulae,  and 
effectively  illustrate  the  manner  in  which  a  selection  influences  size,  variation,  and 
correlation. 

Consider  the  long  bones  femur,  tibia,  humerus,  and  radius,  indicated  by  the 
subscripts  1,  2,  3,  4  respectively,  and  let  m-^,  m^,  m.^,  wi^  be  the  mean  values  in 
centimetres.  Then  the  following  numerical  values  are  given  in  a  memoir  by  Miss 
Alice  Lee  and  myself : — * 


French 

Aino 

mi 

45-23 

40^77 

m-2 

36-81 

33  •SO 

ms 

33-01 

29^50 

24-39 

21  ^55 

2-37 

1-90 

1-80 

1-67 

1-54 

1-34 

1-17 

1-06 

r\2 

•806 

■827 

^'13 

•842 

•858 

ru 

•744 

•789 

''••23 

•860 

•745 

•780 

•865 

•845 

•776 

*  '  Roy.  Soc.  Proc.,'  vol.  61,  p.  343  et  seq.  The  correlations  are  not  worked  out  for  exactly  the  same 
lengths  in  the  case  of  the  two  races,  but  the  numbers  will  serve  quite  well  for  the  purpose.s  of  illustration. 
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Now  let  us  select  from  the  French  population  a  group  having  the  same 
characteristics  of  the  long  bones  of  the  leg  as  the  Aino  population,  and  then  compare 
the  characteristics  of  the  arm  bones  of  this  selected  group  with  those  of  the  Aino 
population. 

Our  selection  is  given  by  : 

=  40-77-45-23  =  -  4-46, 

=  33-89-36-8I  =  -  2-92, 
5i  =    1-90  =  -827, 

s,=  1-67. 

The  following  constants  must  now  be  determined  arithmetically  : 

sja,  =  r90/2-37  =  '802, 
^2/0-3=  1-67/1 -80  =  -928, 

1  —  rj3^  1  —  ''la" 

ntziZiZM  ^  .329,  'j^^ZlJ^=-515,- 
1  —  1  ~"  '^13 

=  -196, 


1  —  r  2 

^12 


1  - 


=  -125. 


If  ic'g  and  x\  be  the  mean  humerus  and  radius  oi  a  femur-tibia  selection  from  the 
French  population,  we  have  from  (Ixv.) — 

.x'g  =  33-01  -j-  -277/^1  +  -443^3, 
x\  =  24-39  +  -I62/ii  +  -335/12. 

These  would  give  the  effect  of  selecting  any  femur  and  tibia  defined  by  and 
from  the  mean  values  of  the  humerus  and  radius.     For  the  particular  selection 
indicated  above  : 

cc'g'-.  30-48,  =  22-69, 

both  of  which  are  about  a  centimetre  in  excess  of  the  Aino  population.    By  selecting, 
therefore,  from  the  French,  a  population  with  a  mean  leg  like  the  Aino,  we  should 
still  find  the  average  arm  of  this  population  some  two  centimetres  greater  in  length 
VOL.  cc. — A.  F 
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than  the  Aino.    The  variabihties  So  and  Sj.  of  humerus  and  radius  for  a  population 
selected  from  the  French  by  femur  and  tibia  are  obtained  from  (Ixiii.)  and  (lix.). 
We  have  : 

t.'lai  =  -196  +  -181  f    Y  +  -268  {    )'  +  2  p,,  x  -221, 

S,V<./  =  -354  +  -108  fi)'  +  -205       J  +  ,  i  i,,^  x  'l  Gi).  ' 

These  give  for  the  particular  case  : 

^3=  1-39,  1-11. 
Turning  to  the  correlation  of  humerus  and  radius,  we  have  by  (Ix.)  : 

=  ^      {-125  +  -140  l^Y  +  "267  ('^  +  -389  -p,,  ^  ^  j, 

giving  for  the  particular  case  : 

rai  =  -799. 

It  will  thus  be  seen  that  if  we  selected  from  the  French  a  group  with  the  same 
variabilities  and  correlation  of  femur  and  tibia  as  the  Aino,  the  variabilities  and 
correlation  of  the  humerus  and  radius  of  this  group  would  not  be  very  different  from 
those  of  the  Aino.  On  the  other  hand,  the  correlations  between  upper  and  lower 
members  would  be  very  significantly  different. 

Generally  we  have  by  (Ixi.)  for  selection  from  the  French  : 

•^»=i:{-*26l+-518p^^}. 

^-  =  i,{-'''i  +  -'''''^-  ' 

These  yield  for  our  particular  case  : 

1-13=  -819,  x,i  =  -694, 
1-3,  =  -845,       r^^  =  768. 

These  are  all  smaller  than  the  corresponding  French  values,  the  selection  has 
reduced  the  correlation,  but  the  Aino  population  has  in  all  the  cases  but  r^^  a  greater 
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correlation  than  the  French.  We  must  accordmgly  conclude  that  by  a  leg  selection 
from  the  French  aimed  at  reproducing  the  proportions  of  the  Aino  leg,  we  should  not 
obtain  an  arm  equivalent  to  the  Aino  arm.  The  divergences  are  indicated  in  the 
accompanying  table  :  — 


Selection  from  the  French. 

Aino. 

Unselected 
French. 

Mean  humerus    ...  ... 

30 

48 

29-50 

33-01 

22 

69 

21-55 

24^39 

Varialnlity  of  humerus  

1 

39 

1-34 

1-54 

1 

11 

1-06 

1-17 

Correlation  of  humerus  and  radius  . 

799 

•776 

•845 

,,       ,,  femur  and  humerus  . 

819 

•858 

•842 

,,       ,,  femur  and  radius 

694 

•789 

■744 

,,        ,,  tibia  and  humerus  . 

845 

•745 

-860 

,,       ,,  tibia  and  radius   .  . 

768 

•865 

-780 

There  is,  of  course,  no  special  reason  for  supposing  that  the  French  and  Aino 
differ  merely  by  an  evolution  which  has  acted  by  selection  of  femur  and  tibia.  We 
might  have  obtained  a  race  out  of  the  French  more  nearly  akin  to  the  Aino  by  a 
selection  of  femur  and  humerus,  but  the  process  would  numerically  be  exactly 
similar.  The  p-articular  illustration  here  chosen  is  taken  merely  as  an  instance,  to 
indicate  how  the  methods  developed  in  this  memoir  enable  us  to  ascertain  with 
quantitative  certainty  how  far  racial  differences  may  be  due  to  the  more  or  less 
stringent  selection  of  a  limited  number  of  organs  in  the  one  race. 

If  we  consider  that  local  races  have  been  differentiated  from  a  parent  stock  by 
the  selection  of  the  chief  or  more  markedly  divergent  organs,  then  we  have  in 
processes  such  as  that  just  illustrated  a  method  of  ascertaining,  at  least  tentatively, 
whether  two  races  are  to  be  considered  as  merely  local  varieties,  and  further  the 
particular  organs  through  selection  of  which  the  differentiation  has  taken  place. 

Illustration  II. — Influence  of  a  Selection  of  Femur  and  Humerus  in  Modifying 

Stature. 

The  following  data  have  been  calculated  for  me  by  Miss  Alice  Lee  from  Rollet's 
measurements  on  the  French  ; — * 

*  They  have  been  undertaking,  with  the  view  of  determining  more  scientifically  than  appears  to  me 
yet  to  have  been  done,  the  mean  stature  of  a  race  from  a  measurement  of  the  long  bones  found  in  burial 
mounds,  &c.  RoLLETb  measurements  are  given  in  '  De  la  Mensuration  des  Os  longs  des  Membres,' 
Lyons,  1889.  I  hope  shortly  to  publish  a  memoir  on  the  subject.  [The  memoir  in  question  was  published 
in  'Phil.  Trans.,'  A,  vol.  192,  pp.  169-244,  1898.] 
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? 

166 

•26  cms. 

154 

•02  cms. 

5 

50 

5 

•45  „ 

45 

23  „ 

41 

•57  „ 

2 

37  „ 

2 

26  „ 

33 

01  „ 

29 

77  „ 

1 

54  „ 

1 

53  „ 

Correlation,  femur  and  humerus,  r.23  .  . 

842 

872 

,,         stature  and  humerus,  r^s 

809 

771 

,,         stature  and  femur,  ri->    .  . 

811 

805 

Now  let  us  select  from  among  French  males  a  group  having  the  same  variability, 
correlation,  and  mean  size  of  humerus  and  femur  as  French  females,  and  let  us  ask 
how  this  would  alter  the  variability  (Sj),  mean  size  (Mj)  of  stature  in  French 
males,  and  also  the  correlation  between  stature  and  humerus  (tja)  and  stature  and 
femur  (rio). 

We  have  at  once  from  the  second  column — 

A,  --  —  3-66,  A3  =  —  3-24, 

6-.  =  2-26,  .S3  =  1-53, 


whence  we  find. 


-^-^=•447,   ^=-433. 

1  - 


^  —  '23  '23 

From  (Ixv.)  we  deduce, 

Ml  =  166-26  +  l-037Ao  +  l-546/)3. 

This  fbrmida  gives  the  stature  of  any  group  of  males  selected  from  the  French, 
and  having  their  femur  and  humerus  respectively  hc>  and  centims.  longer  than  the 
average. 

For  the  special  selection  referred  to  above,      =  —  3'66  and     =  —  3 '24,  hence 

Ml  =  166-26  —  3-79  —  5-01  =  157-46. 

This  example  shows  us  that  if  we  selected  French  men  with  the  same  femur  and 
humerus  as  French  women,  it  would  be  the  selection  of  the  humerus  which  would 
contribute  mostly  to  the  reduction  of  stature — a  somewhat  singular  result.  Further, 
such  a  selected  group  of  French  men  would  be  still  some  3^  centims.  taller  than  the 
average  of  French  women  (instead  of  about  12^  centims.).  Probably  had  we 
selected  the  tibia  as  well,  the  greater  portion  of  this  remaining  advantage  in  height 
would  have  disappeared. 
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To  find  the  A  ciriability  in  stature  of  the  selected  group  we  nuist  use  (Iviii.).  We 
deduce : 

V  =  -1=  {  -287  +  •200(;;y  +  -187  (^J  +  -SST  ^^Ip]. 

In  our  particular  case  this  gives  : 

ty  =  <x,X  -987  =  5-43. 

The  actual  variability  in  stature  of  French  women  is  measured  by  a  standard 
deviation  of  5-45.  Hence  our  selected  group  of  men  would  be  sensibly  equally 
variable  with  French  women,  as  far  as  absolute  variation  is  concerned. 

Lastly,  from  (Ixi.) : 

r,3  =  ^  {-433  :;  +  -447  ^4 
which  give  in  our  particular  case  : 

r^,  =     X  -8011  =  -811, 

ri3  =  5  X         =  '^12. 

Such  a  selection,  therefore,  would  accordingly  only  increase  insensibly  the  corre- 
lation between  stature  and  humerus,  while  leaving  that  between  stature  and  femur 
the  same.  The  sensible  reduction  of  correlation  between  stature  and  humerus  ("809 
to  "771),  which  is  found  as  we  pass  from  male  to  female,  does  not  arise  when  we 
select  a  group  of  males  with  their  femur  and  humerus  of  the  same  length,  variation, 
and  correlation  as  those  of  the  females. 

Illustration  III. — Injluence  of  a  Selection  of  Stature  in  Modifying  Femur 

and  Humerus. 

Let  us  select  a  group  of  French  men  having  the  same  height  and  variability  in 
height  as  French  women,  and  calculate  the  changes  which  will  arise  in  their  femur 
and  humerus. 

Here  the  selection  is  given  by  : 

=  —  12-24,        Sy  =  5-45. 
We  now  need  only  the  earlier  formulae  of  this  memoir.    From  (1.)  we  find 


M3  =  Wo  +  -349/61,       M3  =  m.^  +  -227Ai. 


38 


PROFESSOR  K.  PEARSON  ON  THE  INFLUENCE  OF  NATURAL 


These  give  for  our  case  : 

M^=  40-95,       M3  =  30-24. 

Thus  a  group  of  males,  selected  to  have  tlie  same  stature  as  the  females,  would 
have  a  slightly  shorter  femur  and  a  slightly  longer  humerus.  A  slightly  longer  femur 
in  woman  and  a  slightly  longer  humerus  in  man  would  thus  appear  to  be  sexual 
characters. 

Turning  to  the  variations,  these  are  given  by  (xlix.).    We  find  : 

So  =  2-36,        S3  =1-53. 

This  shows  us  that  while  the  selection  would  give  the  same  variability  of  humerus 
to  the  men  that  women  have,  it  would  fail  to  produce  the  reduction  of  variability  in 
the  femur,  which  is  characteristic  of  the  women. 

From  (li.)  we  deduce 

=  -808,       r,.,  =  -806, 

while  from  (Ivi.)  we  have 

=  -840. 

Thus  we  see  that  very  small  clianges  would  be  made  in  the  correlations,  stature 
and  femur,  stature  and  humerus,  and  fenun-  and  humerus,  if  we  selected  French  men 
to  have  the  same  size  and  variability  of  stature  as  French  women.  The  explanation 
of  this  lies  in  the  nearly  equal  absolute  variability  of  the  two  sexes  with  regard  to 
stature,  for,  as  we  have  seen,  it  is  the  selection  of  variability  which  modifies  correlation. 
Looking  at  the  table  of  values  on  p.  36,  we  see  that  the  largest  difference  of 
variability  in  the  two  sexes  lies  in  the  femur,  and  accordingly  it  is  from  a  selection  of 
femur  that  we  should  expect  the  greatest  differences  in  the  variability  and  correlation 
of  the  two  sexes  to  have  arisen,  but  even  this  difference  alone  would  not  account  for 
the  observed  sexual  differences  in  the  correlation.  Indeed,  it  would  be  surprising  if  it 
did,  for  the  selection  of  other  organs,  notably  the  pelvis,  must  have  played  a  con- 
siderable part  in  the  differentiation  of  sex.* 

(9.)  I  shall  now  proceed  to  a  series  of  problems,  which  will  show  the  application  of 
results,  such  as  those  obtained  in  this  memoir,  to  questions  which  arise  in  dealing 
with  inheritance  and  selection.  If  we  suppose  a  general  population  to  have  statistical 
"  constants,"  which  remain  constant  at  any  rate  for  a  moderate  interval,  we  still  want 
to  know  not  only  tlie  error  which  may  arise  from  a  random  sampling,  but  also  the 
sort  of  effect  which  results  from  our  sample  being  too  much  drawn  from  one  kind  of 
environment,  from  a  rather  limited  class,  or  from  any  other  practically  necessary  or 
unconsciously  introduced  limitation  of  the  random  character  of  our  sample. 

*  "  Primitive  man  and  woman  are  more  nearly  equal  in  size,  varialiility,  and  correlation  than  highly 
civilized  man  and  woman  "  ('Roy.  Soc.  Proc.,'  vol.  61,  p.  354). 
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Illustration  I. — To  find  the  Influence  on  the  Intensity  of  Parental  Heyediiy  of 

the  Selection  of  Parents. 

Let  the  subscripts  1,  2,  3  refer  resiDectively  to  father,  mother,  child.  Let  us  first 
select  one  parent — say,  the  father — very  stringently,  i.e.,  sjcr^  =  is  small.  Then  we 
need  only  equations  (xhx.)  and  (li.).    These  give  us  : 

and 

The  first  may  be  written 

 ^3  

or,  we  see  that  rj,^  will  decrease,  as  /x^  decreases.    Thus  if  r^^^  —  "4  we  have  for 

t^i  =  1/2, 
^1  =  1/4, 

It  is  clear,  therefore,  that  the  correlation  of  parent  and  child  will  be  much  reduced  by 
such  a  selection.  On  the  other  hand,  the  regression  coefficient  will  not  be  altered,  i.e., 
^sl^isA'i  —  o^2^'i3/^i>  w6  have  seen.  Hence  in  problems  of  heredity,  where  we 
suspect  a  parent  to  have  been  highly  selected,  we  should  seek  for  the  regression  of  son 
on  parent  rather  than  for  the  correlation.  Thus  in  the  case  of  Basset  Hounds,*  some 
if  not  all  the  reduction  in  correlation  between  sire  and  offspring  may  be  due  to 
selection  of  the  sire.  A  test  of  whether  the  reduction  in  correlation  is  due  to 
selection  of  a  parent  ought  to  be  given  by  a  comparison  of  s^  and  So.  We  cannot, 
I  think,  suppose,  unless  natural  selection  be  very  stringent,  that  cto  differs  much  from 
(Ty  Hence  it  follows  that  ^Js^  ought  generally  to  be  large,  if  there  be  selection  of 
a  parent.  We  can  hardly  test  this  point  effectively  in  the  case  of  the  Basset  Hounds, 
owing  to  the  nature  of  the  classification.  In  racehorses,  although  the  sire  appears 
to  be  far  more  selected  than  the  dam,  there  is  not  a  great  reduction  of  the  coefficient 
of  correlation  between  sire  and  offspring, f  s-^  appears  to  be  less  than  ^o,  but  not  so 
greatly  and  certainly  less,  that  we  can  be  surprised  that  thp  correlation  of  sire  and 
offspring  is  not  much  less  than  we  have  found  it  for  material  in  which  selection  of  the 
father  is  certainly  far  less  marked.    We  must  accept  the  warning  as  to  the  reduction 

*  '  Roy.  Soc.  Proc.,'  vol.  66,  p.  L57. 
t  See  'Phil.  Trans.,'  A,  vol.  195,  p.  93. 


1-13  =  -2132,  tja.  =  -8367, 

=  -1085,  Vo-e  =  '7'')06, 

r^o  =  -0545,  So/o-,2  =  -7786. 
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ill  correlation  produced  by  the  stringent  selection  of  one  parent,  but  we  must 
remember  the  complexity  of  the  factors — the  variety  of  other  influences  at  work  in 
selecting  and  modifying  selection — before  we  lay  much  stress  on  this  source  of 
alteration  in  parental  correlation. 

Now  let  us  deal  with  the  case  of  both  parents  selected,  and  suppose  their  selection 
given  by  Si/cr^  =  /x^,  sja-.^  =  /xo,  and  the  change  of  their  coefficient  of  assortative 
mating  from  rj,,  to  We  have  from  formulae  (Ixviii.)  and  (Ixx.)  by  a  little 

rearranging 

^3'  =  0-3^1  -  A3'  (1  -  ^1")  -  A3'  (1  -  H^i)  -  ^  (^'b  -  Pii^i/^s)  A3A3} 

^31^13  =  0-3  (^'i3  -  (1  -  /^i)  A3  -  (''is  —  A3}  (Ixxii.), 

where  /S^g  =  {r-^^  ~  u^r^.^)  /  (1  —  *"i2~),  A3  =  (''23  ~  '^n^'n)  /  (1  ~  '^'is^)-  Now  let  us  take 
special  cases  to  bring  out  points.  Let  us  suppose  rjo  =  0,  or  no  assortative  mating  to 
exist,  and  let  us  enquire  what  change  would  be  made  in  parental  correlation  if  we 
selected  parents  who  had  assortatively  mated,  without  altering  their  variability,  i.e., 
let  us  take  f^i  =  1,  /a^  =  1,  we  have  at  once 


S3"  =  0-3^1  +  '^Pi-zVzi) 

^sl'lS  =  0-3  (^'13  +  Pvz^'-z^)  ■ 


(Ixxiii.), 
(Ixxiv.), 


for  = 

•4  and 

Pu  = 

•1, 

^3/0-3 

•0149, 

1-13  =  -4335, 

Pie  = 

•2, 

=  1 

•0315, 

=  -4653, 

Pu  = 

■3, 

•0469, 

=  -4967, 

Piz  = 

•4, 

•0621, 

=  -5271, 

Pu  = 

•5, 

•0770, 

=  -5571, 

Pi^  = 

1, 

1489, 

=  -6963. 

Thus  the  general  effect  of  assortative  pairing  of  parents  is  to  increase  the  correlation 
between  parent  and  offspring  sensibly,  but  not  to  very  rapidly  increase  the  variability 
of  the  offspring.  Thus  marriages  within  a  class  would,  if  heredity  statistics  were 
collected  for  a  class,  tend  to  show  increased  parental  correlation.  Very  high  assorta- 
tive mating  no  doubt  occurs  with  some  forms  of  breeding,  and  we  may  well  find  in 
such  cases  higher  values  of  the  parental  heredity  than  we  should  obtain  for  a  popula- 
tion of  the  same  species  with  random  mating.  I  think  this  may  be  an  effective  factor 
in  the  raising  of  the  parental  correlation  in  the  case  of  coat-colour  in  thoroughbred 
horses. 
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Now  let  us  see  what  happens  if  we  select  both  parents  moderately.    As  test  c^ses, 
let  ns  take  /x^  =  ,u.  =  '8  and  -5,  and  for  the  extreme  =  0. 
We  have  at  once 

$3  =  (r3v/-68  +  '3211^^1  -Fpn)  (Ixxv.), 

^^31-13  =  0-3  {•4/^1  (1  +  pu)]  (Ixxvi.). 

Hence  we  deduce 


f'l  =^[J-2  =  ■  8. 

•5. 

/Xi=/X2 

=  0. 

23/0-3. 

ri3- 

23/0-3. 

23/0-3. 

ri3. 

•1 

•9515 

•3700 

■8764 

•2865 

■8246 

0 

•2 

•9G22 

•3991 

•8809 

■3093 

•8246 

0 

•3 

•9727 

•4277 

•8854 

•3316 

•8246 

0 

•■1 

•9832 

•  4556 

•8899 

•3535 

•8246 

0 

•3 

•9936 

■4831 

•8944 

•3750 

•8246 

0 

1 

1-0438 

•6131 

•9165 

•4762 

•8246 

0 

This  table  is  very  instructive.  It  shows  us  that  selection  and  assortative  mating 
are  factors  of  opposite  influence ;  that  if  selection  be  only  moderate,  then  with 
considerable  assortative  mating  the  coefficient  of  parental  correlation  may  be 
increased,  but  if  selection  be  stringent,  then  assortative  mating  cannot  counteract, 
even  if  as  high  as  "5,  its  destructive  influence  on  parental  correlation. 

For  example,  if  we  take  parents  remarkable  for  some  intellectual  or  physical 
chai'acter,  say  with  a  variability  only  a  very  small  fraction  of  that  of  the  general 
population,  then,  however  proportionately  we  might  pair  them,  we  should  find  their 
relationship  to  their  children,  as  measured  by  the  coefficient  of  correlation,  very 
sensibly  reduced  below  that  of  the  general  population.  I  think  we  have  here  the 
reason  why  Mr.  Galton's  Family  Data,  which  were  drawn  from  a  rather  narrow 
class,  and  had  only  a  small  coefficient  of  assortative  mating,  give  so  much  smaller 
parental  correlation  than  my  own  Family  Data,  which  seem  to  me  drawn  from  a 
wider  class,  and  have  a  considerably  higher  assortative  mating. "^^^ 

It  will  be  clear  that  with  factors  like  assortative  mating,  natural  selection,  artificial 
selection  of  breeders,  unconscious  selection  of  material  from  one  class  or  one  environ- 
ment, modifying  our  coefficients  of  heredity  in  one  or  another  direction,  we  can  hardly 
hope  for  more  in  practical  statistics  than  an  approximation  to  the  strength  of  the 
pure  inheritance  factor  by  dealing  w^ith  the  average  of  as  many  races  and  characters 
as  possible. 

*  The  work  for  Mr.  Galton's  Family  Data  is  given,  '  Phil.  Trans.,'  A,  vol.  187,  p.  270.     My  own 
results  are  as  yet  unpnlilished.    The  average  value  is  about  •  45,  as  compared  with  Mr.  Gat.ton's  ■  34 
VOL,  CC. — A.  G 
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Illustration  II. — To  Find  the  Influence  of  Parental  Selection  on  Modifying 

Fraternal  Correlation. 

Let  the  subscripts  1,  2  represent  the  parents,  and  3  and  4  two  of  their  offspring. 
Let  us  first  select  one  parent  only,  the  selection  being  given  as  before  by  sJ(Ti  =  fii- 
Our  formulge  will  now  be  (liv.)  and  (Ivi.).  So  far  as  the  change  in  variability  is 
concerned,  we  have  already  discussed  it  under  our  first  illustration,  so  we  need  only 
consider  : 

~  V'il -a- -(!-/.?)  V}    •     •     •  • 
Now       "  r,,,  if  the  offspring  are  of  one  sex  ;  hence  : 

• (Ixxvm.). 

If  we  take  r^^  —  "4  and  7^34  =  '5  as  reasonable  values,  we  have 

1.31-784^.16^2  Uxxix.;. 

Thus  1*34  "^^i^^l  t)e  greatest  when  fi^  is  greatest,  i.e.,  when  there  is  no  selection,  and 
will  decrease  with  /x^  until  it  reaches  '4048,  when  /x^  =  0,  or  there  is  selection  of 
fathers  of  one  value  of  the  character  only.* 

The  selection  of  one  parent  only  does  not,  therefore,  immensely  modify  the 
correlation  of  brothers.  Still,  if  we  work  sensibly  with  one  class  of  the  community — 
say,  men  of  genius — we  should  expect  to  find  their  sons  rather  less  like  each  other 
than  if  we  worked  with  the  general  population  of  brothers. 

Now  let  us  select  both  parents.  Here  again  the  variability  of  the  offspring  has 
already  been  dealt  with.  We  are  concerned  with  equation  (Ixix.),  and  we  shall  put 
^'1.3  —  —  ^'33  —  ^'24.  —  make  parental  influence  equipotent  for  the  two  sexes. 
Hence 


where  r  is  the  parental  correlation,  and  r^.j  the  coeflacient  of  assortative  mating. 
Hence  we  find 

_  ^'34  -  /3"  {1  -  /^^  +  1  -       +  2  (r^a  -  Pu^l^^^^i)}  /1vyy\ 

To  reduce  to  numbers,  suppose  =  ju,^,  and  r^c,  =  0  for  the  general  population.  We 
have 

*  In  general  the  value  of  rst  ranges  from  r^i  down  to  • 

1  -  ns^ 
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Hence  if  we  put  r^^  =  -5,  and  r  =  '4, 

_  -18  + -32/^^1  +  .. 
^34,  -  .gg  ^  .32^3(1  +  p^^^  Uxxxn.;. 

The  following  table  will  suffice  to  indicate  tlie  changes  which  take  place,  when  we 
give  a  series  of  values  to  and  p^.,.  Thus  the  first  row  gives  the  influence  of  selecting 
parents  without  any  assortative  mating.  We  see  that  with  increasing  stringency  of 
selection  the  reduction  of  correlation  is  very  considerable,  and  that  with  such  selection 
the  influence  of  assortative  mating  becomes  less  and  less.  Nevertheless,  assortative 
mating  can  produce  quite  sensible  results,  if  there  be  little  or  no  selection.  I  am, 
indeed,  inclined  to  think  that  a  good  deal  of  the  high  values  found  for  the  fraternal 
colour  correlation  in  the  thoroughbred  foals*  is  due  to  much  assortative  colour  mating 
in  sire  Snd  dam'.    Of  course  it  cannot  be  all  due  to  this  source. 

Values  of  Fraternal  Correlation  with  Parental  Selection. 


/^i=  '8. 

III  =  •  6. 

IH  =  •  4. 

/^i=-2. 

IH  =  0. 

•5000 

•4349 

•3712 

•3162 

•2783 

•2647 

Pl2  =  ■  1 

•5155 

•4477 

•3802 

•3209 

•2796 

•2647 

Pi->=-2 

•5301 

•  4599 

•3889 

•3256 

•2809 

•2647 

PV2.  =  '3 

•5438 

•4716 

•3974 

•3303 

•  2823 

•2647 

Pi-2  =  ■  5 

•5690 

•4935 

•4137 

•3392 

•2849 

•2647 

l>v>=l 

•6212 

•5411 

•4508 

•3609 

•2914 

•2647 

On  the  whole,  I  think,  we  may  conclude,  so  far  as  tlie  relative  influences  of  sexual 
selection  in  the  form  of  assortative  mating  and  natural  selection  go,  that : 

Both  sexual  and  natural  selection  can  sensibly  modify  the  intensity  of  inheritance 
as  measured  hy  the  coefficient  of  correlation,  the  former  tends  to  raise,  the  latter  to 
lower,  its  intensity.  But  the  effect  of  the  latter,  if  at  all  stringent,  is  to  comjjletely 
mask  the  effect  of  the  former. 

In  fact,  we  may  write 

l-2r2  +  2rV(l  +  P:,)' 

Hence  the  smaller  -|- p^^),  the  smaller  will  be  fraternal  correlation.  This 

varies  as  the  square  of  fi^  and  only  as  the  linear  power  of  1  +  p,o.  Thus  we  see  at  once 
why  stringency  of  selection  is  far  more  potent  than  assortative  mating. 

Illustration  III. — To  fiud^  the  iyifluence  of  selecting  ttvo  organs  A  and  B  in 
d  parent,  on  the  correlation  of  the  like  organs  A'  and  B'  in  the  offspring. 

Let  the  organs  in  the  parent  be  denoted  by  1  and  2,  and  in  the  oflspring  by  3  and  4. 
Suppose  the  organic  correlation  of  the  two  organs  in  the  general  population  to  be  r', 

*  'Phil.  Trans.,'  A,  vol.  195,  p.  93. 
G  2 
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so  that  =  r-gi,  =  r'  before  any  selection  takes  place.  Let  r,  the  correlation  of  the 
organs  in  the  parent  and  offspring  he  supposed  to  be  the  same  for  both  organs  ;  then 
r  =  =  r.^.  Finally  we  have  the  coefficients  of  cross-heredity,  r^^  and  r^^.  These 
must  vanish  if  there  be  no  heredity  and  no  organic  correlation,  and  should  be  perfect 
if  both  these  are  perfect.  Hence  we  will  take  r^^  =  r.^^  =  rr'  as  a  probable  hypothesis.* 
With  these  values  of  the  correlation  coefficients  we  easily  find 


Hence  from  (Ixviii.)  and  (Ixix.)  we  have  : 


1*31.  = 


(Ixxxiii.), 
(Ixxxiv.). 


For  simplicity,  suppose  the  stringency  of  the  selection  to  be  the  same  for  both 


organs,  then  : 


1  -  r2 


1 


1  -  r2 


If  r  =  "4,  and  p^.,  =  yr', 


1  +  WT/^r 

The  following  table  indicates  the  value  of  X->,i/r' : — 


(Ixxxv.), 


^1  =  L 

Mi=  -8. 

^1  =  •  6. 

/xi=-2. 

/^i  =  0. 

7  =  1 

1 

1 

1 

1 

1 

7  =  0 

8 

•9680 

•9783 

•9872 

•9941 

•9985 

7  =  0 

6 

■9360 

•9565 

•9743 

•9882 

•9970 

7  =  0 

4 

•9040 

•9348 

•9615 

•9823 

•  9955 

7  =  0 

2 

•8720 

•9131 

•9487 

•9763 

•9940 

7  =  0 

0 

•8400 

•8913 

•9358 

•9704 

•9924 

It  will  be  clear  from  this  table  that  if  the  selection  be  at  all  stringent,  no  reduction 
of  organic  correlation  in  the  parents  will  affect  substantially  the  organic  correlation  in 
the  offspring. 

On  the  other  hand,  if  y  be  >  1,  we  can  have  considerable  modifications  in  the 
value  of  the  correlation,  even  if  the  selection  be  stringent, 

*  See  '  Roy.  Soe.  Proc.,'  vol.  62,  p.  411.  I  have  sx  good  deal  of  data  on  the  Value  of  these  crOss4eredity 
correlations  now  reduced  and  soon  to  be  published. 
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/Xl=  1. 

^>4=  -8. 

/ii=  -6. 

Mi=  -4. 

/xi=-2. 

y=  2 
=  10 
=  50 

1-  1600 

2-  4400 
8-8400 

1-1087 
1-9779 
6-3243 

1-0642 
1-5775 
4-1448 

1-0296 

1-  2662 

2-  4492 

1-0076 
1-0681 
1-3705 

1 
1 
1 

Lastly,  if  r'  =  0  : 

1 


hiJpvi  is  given  by  : 


/ii  =  1. 

Mi=  -8. 

/xi  =  •  6. 

/x.i=  -4. 

Mi=-2. 

/^i  =  0. 

Xzilpi-i 

-1600 

-1087 

-0642 

•0296 

•0076 

0 

Thus,  even  if  there  were  no  correlation  between  the  organs  A  and  B  in  the  general 
population,  still  a  selection  of  parents  in  which  such  organs  were  correlated  would 
lead  to  offspring  with  correlated  organs  A'  and  B'.  The  amount  of  such  correlation 
would  only  be  '1600,  if  the  variability  of  the  parent  were  not  selected,  and  would 
diminish  rapidly  with  stringent  selection  of  variability.  Still  'IGOO  is  quite  sensible, 
and  would,  if  the  selection  continued  for  a  few  generations,  continue  to  increase. 
Thus  we  see  how^  selection  of  a  pair  of  organs  in  a  parent  may  increase  or  even  create 
correlation  between  the  like  organs  in  the  offspring. 

The  reader  will  find  other  interesting  illustrations  in  tracing  the  influence  of  an 
absolute  selection  of  one  parent  only  on  the  correlation  of  the  offspring,  e.g. ,  relation 
between  pairs  of  foals  which  all  have  a  common  sire,  the  influence  of  selecting  an  organ 
A  in  the  sire  and  an  organ  B  in  the  dam  on  the  correlation  of  the  organs  A  and  B  in 
the  offspring,  the  influence  on  assortative  mating  of  selecting  parents  of  men  of 
genius,*  and  in  many  other  problems. 

(10.)  It  is  not  without  value  to  consider  how  p^.j  arises  in  the  case  of  natural  or 
artificial  selection.  Suppose  we  have  two  organs,  A  and  B,  then  we  shall  endeavour 
(i.)  to  give  these  definite  values,  say  x  and  but  we  shall  not  be  able  to  get  all  our 
individuals  with  such  absolute  values,  we  shall  select  with  certain  deviations  from 
X  and  y,  given  by  x  —  x-\'x'  and  y  r=iy     y\  say.    Further,  we  shall  endeavour  to 

*  This  is  a  peculim  ly  interesting  case,  for  if  we  select  men  of  remarkable  intellectual  ability,  we  should 
expect  to  find  both  parents  above  the  average  of  the  general  population,  but  with  a  negative  correlation 
between  them  amounting  at  a  maximum  to  -  •  1905. 
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make  y  some  function  of  £c,  say  y  =  f{x)  or  y  +  y'  —  f{oc)  +  x  f  {x)  +  —  f"  {x)  + . . . 
by  Taylor's  theorem.    But  y  =f  (x)  and  x  is  small,  so  that  our  attempted  relation 

will  be  of  the  form  : 

—  -  y'  =  mx'. 

Here  m  is  the  slope  of  a  line  to  which  we  endeavour  to  confine  the  selected  organs. 
But  we  shall  not  he  quite  able  to  exactly  hit  this  relation  off ;  when  x  —  e,  we  shall 
find  that  y'  =  me  -j-  7^,  where  17  is  a  small  unavoidable  error  in  selection  of  y'  uncor- 
I'elated  with  e.    Thus,  if     and  Sr,  be  the  selected  variabilities,  we  shall  have  : 

sj^  =  -  S  {i/-)  -  ^  S  (nu  +  yjf,         =  ^-  S  (e-). 

n     ^'    '       n.     ^  "         ^        11      ^  ' 

Therefore  :  sj''  =  on^s^^  ^~  ^  ^  ^'^')" 

Further  •  —  ^  ^^''''"^  =  ^{(me  +  r,)e}   _  Tusy 

and  therefore  :  ~  S  (77-)  =  .s^"  (1  —  /Oj-/)- 

Or,  is  at  once  obtained  from  the  slope  of  the  line  m,  by  which  we  endeavour  to 
fix  the  relationship  of  the  organs  A  and  B.  Or,  again,  we  may  look  upon 
.s'ov/1  —  as  a  quantity  measuring  the  mean  divergence  of  the  B  organ  from  that 
absolute  fulfilment  of  the  relationship  between  A  and  B  which  we  are  striving  to 
attain.  Thus  p^^  is  a  quantity  which  naturally  arises  in  every  attempt,  whether 
artificial  or  natural,  to  select  organs  having  a  definite  relationship  to  each  other. 

Much  the  same  considerations  arise  when  we  select  three  or  more  oi"gans.  In  each 
case  the  selected  coefficients  of  correlation  are  constants  which  enable  us  to  express 
(i.)  to  a  first  approximation  the  form  of  relationship  we  are  aiming  at,  and  (ii.)  the 
average  degree  of  divergence  from  absolute  fulfilment  of  this  relationship. 

Thus,  without  regard  to  any  particular  distribution  of  frequency,  the  s's  and  the 
p's  are  the  appropriate  constants  to  express  approximately  the  nature  of  any  form  of 
natural  or  artificial  selection. 


(11.)  On  the  FrohahiUty  of  Survival  and  the  Surface  of  Survival  Rates. 

■  In  the  course  of  the  present  paper  I  have  assumed  that  when  measurements  are 
made  on  any  population  for  a  complex  of  n  organs,  the  frequency  surface  may  be 
taken  as  approximately  normal.  If  this  holds  for  the  population  before  and  after 
selection,  and  measurements  made  on  many  groups  at  difterent  periods  of  life  seem 
to  indicate  that  it  is  approxiniately  true,  it  follows  that  we  can  determine  the  form 
of  the  probability  of  survival  as  a  function  of  the  means,  variations,  and  correlations 
of  the  selected  and  unselected  populations. 
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Let  the  unselected  population  be  given  by 

Z  =  Zq  expt.  —  1 1  CiiXf  +  t-o.jx/  +  .  .  .  +  c„„x"- 

+  2c^,x^x..-\-  .  .  .  +  2c'„_i,„x„_iX„}  .    .    .  (Ixxxvi.). 

Let  the  probabihty  of  survival  be  given  by 

p  =p^/{x^  —  k^,    x.  —  h,    x.^  —  /v-3,  .  .  .  x„,  —  k,)     .    .  (Ixxxvii.), 

where  /  is  at  present  an  unknown  function,  which  is  to  be  a  maximum  for 

X^         Jc^,      X.2  —    ^^3)  •  •  •  '^;»  —  ^'«) 

and,  if  the  selection  be  at  all  stringent,  to  take  rapidly  decreasing  values  as 

X-^  —  JCj,      Xo  '  "^S        ^3'  •  •  •  ^" 

take  increasing  large  negative  or  positive  values.  It  will  be  clear  then  that  the 
individuals  who  are  "  fittest  to  survive,"  i.e.,  have  the  smallest  death-rate,  are  those 
whose  organs  are  defined  by  :  - 

—  —  ^'2'  *  *  '  — 

and  fitness  generally  will  be  measured  by  the  closeness  of  the  individual  to  these 
"  fittest"  individuals. 

In  order  to  find  the  surface  of  survivors,  immediately  after  the  selection  if  growth 
be  taking  place,*  or  at  any  later  stage  if  growth  have  ceased,  we  have  only  to  multiply 
Z  by  p,  or : 

z  =  7i  X  ]p     .  (Ixxxviii.), 

is  what  in  the  earlier  part  of  this  memoir  I  have  termed  the  selection  surface.  Now 
if  this  selection  surface  be  itself  normal,  it  will  be  of  the  form  : 

2  =  Zq  expt.  -  4       (a^i  -  h^Y  +      {x^  -  h^f  +  .  .  . 

+      {x„  —  h„f  +  2613  {x^  —  b^)  {x^  —  62) 

—  .  .  .  +  25„_i,^(a;„_i  —  h„_y)  {x^  —  h,)     .    .  (Ixxxix.). 

Here,  as  m  the  value  of  Z,  all  the  constants  h^^,  h^^,  •  •  •  b„,„  b^^  .  .  .  b„_i  „  are  known 
in  terms  of  the  variations  and  correlations.  If  there  be  selection  of  q  organs  only 
out  of  the  n,  then  6^+1,^+1  .  .  .  &2,!z+i5  •  •  •  will  all  be  zero.    Since  by 

Equation  (Ixxxviii.)  _p  =  2^/Z,  it  follows  that  the  function  /which  defines  the  prob- 
ability of  survival  must  be  of  the  normal  exponential  type,  or 

*  I  propose  to  deal  in  another  memoir  with  the  important  problems  of  slow  selection  during  rapid 
growth,  and  of  secular  selection  during  several  generations. 
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/(•^i  —  ^'i'  -'^'i  ~  ^'c'  •  ■  ■      —  ^'n)  =  expt.  —  h       {x^  —  l^)'  -j-  ci..  {x.,  —  h.)'  +  .  .  . 

Thus,  to  determine  the  probabihty  of  survival,  we  require  to  know  the  values  of 
the  rt's  and  ^'s  in  terms  of  the  h's,  Jis,  and  cs.  The  shortest  method  of  finding  p,^  is 
to  put  x^  =  Jcy,  X.,  =  /o,  .  .  .  x„  =  l\„  and  then  note  that : 

_  z(^\  —  l\,  .v.,  =  I-,,  .  .  .  y„  =  /■„)  . 

Since  -  p  =  z/Z,    and    2;  =  2^/^'^, 

we  see  that  the  relations  for  p,  given  2:  and  Z,  and  for  2,  given  j:>  and  Z,  are  cyclicly 
interchangeable  if  at  the  same  time  we  change  to  —  c^,  c«  to  —  003 .  .  .  c^^  to 
—      .  .  .  C;,_i_„  to  —  If  (Tj,  cTo,  .  .  .  fr„  be  the  standard  deviations  of  the 

unselected  population,  this  amounts  to  changing  o-y,  cr^,,  ...  cr,,  to  a/  —  la^,  \/  —  lo-^, 
...  v/  —  lcr„  resjDectively.  Thus  the  results  which  give  the  probability  of  survival 
in  terms  of  unselected  and  selected  populations  can  always  by  an  easy  interchange 
be  used  to  obtain  the  selected  population  from  a  knowledge  of  the  miselected 
population  and  of  the  probability  of  survival. 

Let  the  unselected  population  be  defined  by  7)i^,  m.,,  .  .  .       cr^,  cr.:,  .  .  .  o-,„  and  r^^, 

Let  the  selected  population  be  defined  by  +  H^,  m.^  +  Hj,  .  .  .  m„  -f-  H,,  ti,  t.., 
'  ■  •  Sh?  and  Xi2i  tis)  103,  .  .  .  X,t-i,if 

Let  the  constants  of  the  probability  of  survival  function,  or  a^^,  a.,^,  .  .  .  a,,,,, 

«i3  .  •  •  be  expressed  by  s,,  s.^,  .  .  .  s„,  p^o,  p^g,  .  .  .  p„^i^„  as  if  it  were  a  normal 

correlation  surface.* 

Then  the  problem  will  be  solved,  if  we  know  the  /.-'s,  6''s,  and  ps  in  terms  of  the  cr's, 
r's,  H's,  'S's,  and  r's. 

Equating  the  squares,  products,  and  linear  terms  in  the  xs  in  the  equation 
p  =  ,i/Z,  we  have  at  once  the  system  : 


_  r  (xcii.), 

for  all  values  of  u  and  v  from  1  to  n. 

—  c(,Jci  —  cirjc.,  —  ...  —  ajc,  —  ...  —  arJc,, 

=  —  ?>,iHi  —  6,oH.,  —  ...  —  ?>,,H,  —  ...  —  h,„ll.    .    .  (xciii.), 
for  all  values  of  v  from  1  to  n. 

*  These  must  not  be  confused  with  the  .So,  .  .  .  s,j,  pi-j,  pis,  p-^s,  .  .  .  Pq,q-\  constants  of  the  c[  selected 
organs  of  the  previous  discussion,    The  new  quantities  may  be  in  part  imaginari/. 
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^  -  &^.^Hi  -  ?>,,H,  -  ...  -  /^a.H,  -  ...  -  ^.„H,  .    .  (xciv.). 


Now  let  A  be  the  determinant 


11 


^"31  -  ?^31'      ^3C  -  ^30>      ^33  -  ^^33' 


-  h 

c,,,  —  h,,, 


C-l't  —  ^-la 


and  A„„  the  minor  of  its  uv^^  constituent. 
Then  we  have  at  once  : 

A  X  h,  =  A,,  (6hHi  +  h,M,  +  .  .  .  +  h,M,) 
+  A,,(6,iHi  +  h^i  +  .  •  .  +  ^^.H^) 
+  A,3  +  63cH,  +  h,^, .  .  .  +  ^^..H,) 

+  A,„(/>,iHi  +  6,,Ho  +  .  .  .  +  ^.«H,). 

Or: 

A  X        =  Hi  (6iiA,i  +  6j3A,o  +  Z>i3A,;.  +  ■  .  •  +  KAn) 
+  Ho  (6.lA,i   +  6.0A,,  +  6.3A,3  +  .  .  .  +  h,An) 
+  H3  (ftgiA,!    +                   +  633A,,;j  +   .    .    .   +  6 
+  

+  H,  (6„iAn  +  6,,A,,  +  6„:;A,:3  +  .   .   .  +  h,„An)        •       •       •  (xCV.). 

Thus  is  determined  in  terms  of  H^,  Hn,  .  .  .  H;„  which  define  the  maximum 
frequency  of  survival. 

In  a  similar  manner  by  making  the  proper  changes  indicated  above  we  have : 

A'  X  H,  =  I;  {a.A'n  +  tti^A',.,  +  Cl.A'n  4-  .  •  .  +  ^vA'rn) 

4-  h  (aoiA',1  +  a-»A'v2  +  chz^'r:i  +  .  .  .  +  a.,A\.,) 

+        {a.,Xn  +  «32A',.3  +  «33A'«  +  .  .  .   +  «3«A',„) 

4  

+  k„  («„iA',i  +  a„oA',3  +  a„3A',;3  —  .  .  .  +  a„„A',„)  .    .    .  (xcvi.), 


where  A'  is  the  determinant 
VOL.  cc. — A. 


H 


50  PROFESSOR  K.  PEARSON  0\  THE  INFr>UENCE  OF  NATURAE 

(C21  +  «-2.),  +  «22),       (^'2.  +  «2::)  •   .   •   (C.2«  +  r',.) 


and  A'„„  the  minor  of  its  u  v'^'  constituent. 

Now  it  will  be  clear  from  these  results  that  as  a  general  rule  it  is  impossible  for  k„  to 
be  equal  to  H,..  In  other  words  :  2Vie  individiial  most  frequently  met  witli  in  any  given 
selected  community,  i.e.,  the  mediocre  individual,  is  not  the  individual  jittest  to  survive. 

It  is  only  in  the  limiting  case  of  natural  selection  being  so  stringent  that  one  type 
of  individual  alone  is  able  to  survive,  that  the  fittest  class  has  a  numerical  majority 
over  any  other  class  of  the  community.  This  seems  to  me  an  important,  algebraically 
almost  self-obvious  truth,  and  yet  one  which  is  very  much  obscured  by  the  use  of  such 
a  phrase  as  the  "  survival  of  the  fittest." 

Of  course,  if  there  be  continuous  selection,  or  an  environment  so  stable  that  the 
probability  of  survival  remains  constant  for  a  long  period,  there  will  be  a  gradual 
approach,  never  theoretically  an  actual  identification  of  the  mediocre  and  the  fittest. 
But  in  actual  nature  the  environment,  at  any  rate  so  far  as  it  depends  on  climato- 
logical  conditions,  must  have  a  long  period  as  compared  with  the  vital  and  reproductive 
periods  of  innumerable  forms  of  life.  A  hard  winter,  a  drought,  a  flood,  a  famine, 
a  plague  or  epidemic  of  any  kind,  even  if  fairly  stringent,  will  rarely,  if  ever,  render 
the  most  frequently  surviving  individual  identical  with  the  individual  who  is  fittest 
to  survive.*  Still  less  will  this  identity  take  place  in  the  many  processes  of  artificial 
selection,  which  are  becoming  and  will  more  and  more  become  valuable  laboratory 
aids  in  our  appreciation  of  the  action  of  natural  selection.  The  divergence  between 
the  most  frequently  surviving  and  the  fittest  individual  is  measured  by  the  above 
formulse  for  the  ^'s  in  terms  of  the  H's.t 

To  complete  the  solution,  the  a's  must  be  found  from  the  equations  of  type 

=  l>i,o  ~  (^irr,  and  then  from  the  «'s  the  .s's  and  ps  follow  by  the  well-known 
determinants  for  multiple  correlation  :  see  our  Equations  (xi.)  and  (xii.). 

Throughout  the  earlier  part  of  this  memoir  I  have  used  only  the  surface  of  selection, 
but  the  above  investigation  will  enable  us  whenever  desired  to  replace  it  by  the 
probability  of  survival.  I  will  illustrate  this  by  obtaining  the  formulae  suitable  to  the 
simpler  cases. 

*  We  badly  want  a  name  for  the  selection  which  acts  for  a  short  time  and  rapidly  modifies  the  adult 
population.  It  is  practically  the  type  of  selection  considered  in  this  paper.  It  is  epidemic  or  catastrophic 
in  character. 

t  The  point  is  of  considerable  importance,  for  more  than  one  influential  writer  has  spoken  of  the  result 
of  natural  selection  as  the  preservation  of  the  type  the  mortality  of  which  is  least  imder  the  given 
conditions. 
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(12.)  Case  {[.).— Selection  of  a  Single  Organ  only. 
The  original  population  is  given  by 


and  the  curve  of  survivors  by 
The  probability  of  survival  is 
where  we  easily  find,  if  S/cr  =  X, 


A;  =  H/(1  -  X~)  (xcvii.), 

7=s/v/r^-   (xcviii.), 

and  p,=  ^e-^^^^)   (xcx.). 


As  an  illustration  consider  a  selection  from  modern  French  peasants,  which  should 
reduce  the  mean  and  variability  of  their  cephalic  index  to  those  of  the  Libyan  race. 
French  peasants  : — 

m  =  79786,       o-  =  ;3-84L 

Libyans  : — 

m  +  H  =  72-938,       S  =  2-885. 
Hence:  H  =-  6-848,       X  =  -751 L. 

These  give  :  —  15-712,       6-  =  4-370. 

Thus  for  such  a  change  as  7  points  in  the  cephalic  index  to  take  place  by  selection* 
we  should  have  to  make  the  "fittest  to  survive"  of  such  a  ridiculously  low  cephalic 
index  as  64-074,  and  such  a  high  variation  as  4*370. 

We  find  =  51-0474  n/N, 

and  accordingly  the  probability  of  survival  given  by 

p  =  51-0474  ^e-<''  + 

where  N  are  the  number  of  Frenchmen  converted  into  n  Libyans  so  far  as  cephalic 
index  is  concerned. 

I  have  purposely  taken  a  somewhat  extreme  case  of  selection  in  order  to  illustrate 
how  widely  the  most  frequently  surviving  individual  can  diverge  from  the  fittest. 

In  this  case,  if  the  chances  of  survival  (i.)  of  the  fittest,  (ii.)  of  the  individuals  most 
frequent  after  selection,  and  (iii.)  of  the  individuals  most  frequent  before  selection,  be 
C^,  Cg,  and  Cg  respectively,  we  have  : 

*  This  is,  of  course,  supposing  the  change  to  occur  by  catastrophic  selection  and  not  by  a  continuous 
secular  selection,  see  footnote  preceding  page. 

H  2 
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:      :  Cg  ::  1  :  '12782  :  -00156, 

or,  the  chances  of  survival  of  an  individual  of  the  fittest  type  v^ould  be  about  eight  times 
as  great  as  those  of  an  individual  of  the  most  frequent  type  after  selection  and  about 
700  times  as  great  as  those  of  an  individual  of  the  most  frequent  type  before 
selection.  If  v^,  v.,,  v.^  be  the  numbers  after  selection  in  the  three  classes*  of  the 
fittest  to  survive,  the  most  frequent  after  selection  and  the  most  frequent  before 
selection,  we  find 

^1  :  I/.  :     :  :  '00892  :  1  :  '05978. 

In  other  words,  the  most  numerous  type  before  selection  is  still  after  selection  about 
6*7  times  as  numerous  as  the  type  with  the  least  mortality,  and  this  latter  type  is 
only  about  jjj  as  numerous  as  the  type  to  be  most  frequently  met  Avith  after 
selection  has  taken  place. 

Thus,  although  there  would  have  been  a  very  great  evolution  in  cephalic  index, 
due  to  a  fairly  stringent  selection,  the  fittest  to  survive  would  always  have  formed 
but  a  small  fraction  of  the  dominant  type.  Even  if  we  were  to  replace  the  selection 
liere  considered  by  a  gradual  evolution  spread  over  several  generations,  we  should 
still  reach  in  the  main  the  same  conclusion,  i.e.,  that  natural  selection  never  proceeds 
hij  the  survived  of  tJie  fittest,  or  the  survival  of  those  with  the  least  death-rate.  These 
will  always  remain  a  small  fraction  of  the  community — they  are  the  goal,  but  often 
the  very  distant  goal,  to  which  selection  tends  to  shift  the  population. 

(13.)  Case  (ii.) — Selection  of  Two  Orr/ans. 
In  this  case  let  the  surface  of  survivors  be  : 

the  original  population  : 

o-i-(i  -        o-jo-o  (1  -  n-:")  - '''ir)J 

and  the  curve  of  probability  of  survival ; 

Since  :  p>  =  ^/Z, 

*  By  individuals  of  a  type  or  class  is  meant  here,  as  elsewhere  in  this  section,  all  the  group  falling 
•within  some  small  definite  range  of  variation  Ij^ing  round  a  particular  value  of  the  organ  (e.g.,  m,  m  +  H, 
or  m  +  /•),  which  defines  the  type  oi'  class, 


SELECTION  ON  THE  VAlil ABILITY  AND  CORRELATION  OF  ORGANS.  53 
we  find  at  once 


1 


h'{l-py.f)          ^iHl-X,,')  CT'^ii-rJ)  ■ 

1  1  1  1 

1  Pli        _  ^13 


h'  (1  -  Pu')  ~  ¥3  (l"- Pi3^)  ~  2r  (1  -  x,J^)  (1  -  x,.^) 


•  (cni-)- 

.  (civ.). 

.  (cv,). 

.  (cvi.). 


¥3(1 -Pi3-)  '  V(l-Pi3-)         2iS,(l-ri,^)  '  2/(1 -rj,2) 
Let   S^/cTi  =  Xi,    2_,/o-o  =  X..   measure    the    stringency   of    the    selection,  and 


/X  =   ^^'^'^^^^'^       change  in  correlation.*  Then  solving  the  above  equations 


we  find  : 


(cviii.), 
(cix.), 


where 


v/r^ '^?'-'Xi2  -  \f  +  (1  -  x,^j\^^^2x^^{\,  •  ■  ■ 

 «   +  ~   ~"  R  ^  ~      ....  (CXU.), 

13=1  ~  r,.f  -      _      +  (1  _  r^.=)  X^3X/  +  2ri,r,.X,X.. 


Similarly,  if  the  original  population  and  the  curve  of  probability  of  surviving 
or  of  survival  rates  be  given,  we  have  to  find  the  selected  population  : 

pir,  +  I'l^V-K-^Ko  ,  .... 

tjo  =  — f--''  0  3   /I  ,    Q  '   (cxni.), 

*  If  ri2  =  COS  ifi2  =  COS  D,  /X  =  sin  D  /  sin  d.  The  quantity  D  has  been  conveniently  termed  the 
"  divergence "  by  Mr.  Sheppard.  Hence  /x  is  the  ratio  of  the  sines  of  the  selected  and  unselected 
divergences.  The  above  formula  for  pyy,  can  be  at  once  changed  into  one  suitable  for  trigonometrical 
logarithmic  calculation.  Let  sin  aj  =  /xXj,  sin  ol=i  -=  /xAo,  and  ~py>  =  cos  S;  then,  if  A  be  the  side  of  the 
spherical  triangle,  of  which  a.^,  cco  are  the  other  sides  and  S  the  included  angle  : 


sin^^S^  ./^i^(^^)^^"HD  +  A)1 
VI,  cos  a-i  cos  J 
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where  v  =  A/T—^^a '    '^i  ~  ^^/^^    '^^^    '^^  ~  ^'^/^s. 


=  -^^Ak^^Ul^   .    ,  - .  (cxiv.), 


'I  "1 


7  ^2  J 


(cxv.), 


Hg    _         ?'l2«3"  —  Pi 3^1  ^2  _|_  ^^2    1  —  ^'12°  +  ^1^  ~  Pl2^'l2*^1^2  (cXVli  ) 

where  y  =  1  —       +      +  /^e'  +  (1  ~~        '<\Ki  —  ^^Pa^'i-z'^i'^i- 


(14.)  Illustration. — The  following  results  are  taken  from  tlie  paper  by  Miss  Alice 
Lee  and  myself  already  cited  : — 


Femur 
Humerus 


Correlation  of  ^ 
Femur  and  j- 
Humerus  J 


Fieiicli  J'. 

Aino 

.     .   ^^1  = 

45*228  centims. 

40-/70  centims. 

—  'Illy 

+  H. 

2-372  „ 

1-898 

=  s,. 

.  = 

33-010 

29-502 

=  m.^ 

(To  = 

1-538 

1-343 

=  23. 

■•     •  'u  = 

-8421  . 

•8584 

=  1*1^. 

As  indicated  by  the  symbols  above,  let  us  select  from  the  French  a  population 
ivinp-  the  same  femur  and  humerus  relations  as  the  Aino. 
We  have  at  once  : 

H,  =  —  3-508, 


Whence  we  find 


Hi- 


•9027, 


4-458, 
-80017,        \.2  = 
p?  =  -9047. 


-87321, 


Sy  =  3-4870, 


s,  =  2-8736, 


I'y  =  —  17-8464, 


k,  =  —  15-8547, 


Po  =  208,425  -  , 


where  n  are  the  number  of  Ainos  which  can  be  obtained  from  N  Frenchmen. 
We  have  accordingly  the  following  form  for  the  surface  of  fitness  to  survive 
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p  =  208,425  ^  expt.  -  l{-222056  (:r  +  17-8464)'-  +  -326,960  {y  +  15-8547)2 

—  -486,450  {x  +  17-8464)      +  1  5-8547)  ^ 

Now  it  is  clear  that  if  we  wanted  by  a  "catastrophic"  selection  to  convert  the 
French  into  something  resembling  the  Aino,  we  should  have  to  give  the  least  death- 
rate  to  those  French  with  femur  corresponding  to  +  and  humerus  to  m.  +  k., 
or  to  the  dwarfs  with  femur  =  27*382  centims.  and  humerus  =  17*155  centims.  ! 
By  no  other  means  could  we  shift  the  modal  value  of  the  French  population  down  as 
low  as  the  Aino  modal  value.  The  physical  meaning  of  this  is  that  we  have  been 
compelled  to  put  on  an  excessive  death-rate  for  the  bigger  Frenchmen. 

An  interesting  point  of  our  work  is  that 

=  1-2514  Hi  -f-  3-4971  Ho, 
=-  1-0266  Hi  +  5-8242  Ho, 

whence  we  see  that  while  a  selective  reduction  of  humerus  is  far  more  effective  in 
reducing  both  femur  and  humerus  centres  of  survival  than  a  reduction  of  femur,  a 
selective  reduetion  of  femur  occurring  contemporaneously  with  that  of  the  humerus 
actually  tends  to  raise  the  centre  of  the  humerus,  i.e.,  the  coefficient  of  H^  is 
negative. 

Now  let  us  consider  the  frequency  of  survivors  per  unit  length,  say  centimetre  of 
femur  and  humerus,  at  different  points.  The  surface  of  survivors,  i.e.,  the  Aino 
population,  is 

_  1       1       5  (X  -  H,-)2    2ri,  {X  -  HQ  (y  -  H^)     (y  -  H^P  ) 

"  27r2iSov/(l-ri2') 

If  we  put  X  =  0,  y  =  0  we  have  the  frequency  after  selection  of  the  original 
population  type  ;  if  we  put  x  =  H^,  ?/  =  Ho  we  have  the  frequency  after  selection  of 
the  new  population  type  ;  and  if  we  put  x  =  k^,  y  =  ko,  we  shall  have  the  frequency 
after  selection  of  those  best  fitted  to  survive.  If  these  frequencies  be  Vg,  v.^,  Vy 
respectively,  we  find  on  substituting  the  numerical  values  that 

:  z/o  :  ^3  ::  -117/10^^  :  1:  -032289. 

Thus  the  most  frequent  type  of  the  new  population  is  now  about  thirty  times  as 
frequent  as  the  old  most  frequent  type,  while  the  type  most  fitted  to  survive  has 
practically  no  existence  at  all.  It  probably  lies  outside  the  actual  boundary  of  the 
French  population. 

Here  really  arises  the  question  as  to  how  we  are,  in  any  actual  problem,  to  fix  the 
ratio  of  n  to  N,  or,  what  amounts  to  the  same  thing,  to  fix  a  practical  boundary  to  a 
given  population.    Such  a  boundary  must  be  conventional,  but  I  think  that  for 
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practical  purposes  we  are  quite  safe  if  we  assume  that  an  individual  who  occurs  only 
once  per  thousand  can  produce  no  effect  on  the  physical  evolution  of  the  population 
as  a  whole. 

Now  the  form  of  a  correlation-surface  for  two  organs,  x  and  ij,  is 

— -      g    "  1  —      i  <ri'     o-i<r.j    cr/  ' 

27r<7iO-o  v/ 1  —  r- 

Let  us  write  k'  =  :;  „  \      ~  ~ — '  +  ~  r  !  then  k  —  a  constant  ffives  a  series  of 

1  —  r-   [o-j-  CTjO-^         (To- J  ° 

similar  ellipses  which  are  the  contour  lines  of  the  surface,  or  lines  of  equal  frequency, 
i.e.,  giving  individuals  with  equal  probability  of  occurrence.  Let  the  equation  to 
these  contour  lines  referred  to  their  principal  axes  be 

"  ~      +  B2  • 

Then  we  have  at  once  : 

10       I  t) 


I  _  _   I  =  — ^_  ii__L^2 

^  B2        O-i^  (1  -  9-3)  ^         (■!  _  ,.2)         1  _  ,.2      ^2^^  ' 


A2B2       cTiV.^  (1  -  r-f       a^^a.^  (1  -  r^f       (1  -  9-)  o-iV.s 

or,  AB  =  a-.cr,  x/T^^\       A:~  •^B'  =  a,'  +  a-A 

Further,  if  </>  be  the  angle  the  A  principal  axis  makes  with  the  axis  of  x,  we 
have  :* 

tan  2(f)  —  2r(T^(r.2l{(j^  "~  ^i)- 

These  fully  determine  the  principal  axes  of  the  frequency  surface.  Now  consider 
the  frequency  between  the  elliptic  cylinders  corresponding  to  k  and  k  -\-  ^k;  we 
have  it 

=  z  X  ^n-AB/ccZ/c  =  s  X  2770-10-.  \/l  —     k^/k  =  N('"''''kc/k-. 
Hence,  if  N^  be  the  frequency  outside  any  contour  k, 

N,  =  N     e--'\dK  =  Ne-'«^^  (cxviii.). 

For  N^  to  be  jo-oo  of  N  we  have  simply 

—   ,  whence  k  —  3-716,923. 

log  e 

*  For  ecasy  calculation  put  y  =  n/o-j^  +  o-g^,  tan  t/'  =  0-2/0-1.    Then  we  have  at  once  if  r  =  cos  D : 

A  =  7  cos  X)       B  =  y  sin  x, 
^^^^^^  =  sin  2x  =  sin  2^  sin  D,       tan  2<t>  =  tan  2^  cos  D. 
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This  will  enable  us  to  determine  our  conventional  boundary  to  effective  population. 
Now  let  us  refer  our  non-selected  and  selected  populations  to  their  centres  and 
principal  axes. 

We  find  for  the  contour  curves  : — 


Unseleeted  Population 
{i.e.,  French). 

Selected  Population 
(i.e.,  Aino). 

Surface  of  Survival 
{i.e.,  Rate  of  Survival). 

centre  (femur)  .... 
//  centre  (humerus)    .  . 
tan  0  (slope  to  ,r)  .... 

Principal  axes  |^  g 

1  in  1000  limit  l^g-    '  ' 

45-228 
33-010 
•601,3775 
2-7338 
•7197 
10-1615 
2^6750 

40-770 
29-502 
-670,1454 
2-2514 
•5808 
8 •3682 
2^1588 

27-382 
17-155 

-807,3371 
4-4115 

•9775 

Referred  to  its  principal  axes,  the  rate  of  survival  is  now 

p  =  208, 


425  —  e    '-1{i-iU5f  (-9775)2* 


Suppose  we  require  to  get  at  least  1000  Aino  out  of  the  French  population,  N,  then 
n  =  1000.  Now  suppose  the  Aino  limiting  ellipse  drawn,  then  the  French  population 
must  be  sufficiently  large  to  give  the  individuals  inside  this  ellipse.  Now  p  gets 
smaller  as  we  go  further  from  the  centre  of  the  survival  surface.  Hence  the  contour 
line  of  the  survival  surface  corresponding  to  p  =  I  must  be  touched  externally  by  the 
limiting  contour  of  the  Aino  population,  in  order  that  we  may  get  at  least  1000  Aino 
out  of  N  Frenchmen.  Now,  by  a  graphical  construction,  I  find  the  major  axis  of  the 
elliptic  contour  line  of  the  survival  surface  which  touches  the  Aino  limiting  ellipse,  is 
about  11-44.  This  gives  for  the  parameter  of  this  ellipse,  k-j  X  4-1149  =  11-44,  or 
Ki  =  2-5932.  Whence: 


p.,=  l=  208,425 -^^°^e- 


gives  the  greatest  possible  value  of  p  and  the  least  possible  value  of  N.  Numerically 
this  gives  us  N  =  7,200,000  about,  or  we  should  want  more  than  7,000,000  of 
Frenchmen  to  obtain  our  1000  Aino  by  a  catastrophic  selection.  The  actual  bounding 
contour  line  of  this  least  possible  number  of  Frenchmen''^'  has  for  its  major  axis 
15-285  centims.,  and  it  touches  the  Aino  limiting  ellipse  at  the  point  where  it  is 
touched  by  the  survival  contour     =  1. 

Now  let  us  turn  the  problem  round  and  ask  what  is  tlie  least  population  of  Aino 

*  The  least  possible  to  reproduce  the  Aino,  as  far  as  femur  and  humerus  are  concerned,  to  1  in  a  1000 
of  the  population. 
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from  which  we  could  produce  1000  Frenchmen  by  a  catastrophic  selection.  In  this 
case  the  surface  of  survivals  is  simply  obtained  by  inverting  p,  and,  if  N'  be  the 
number  of  Aino,  n  =  1000  =  required  number  of  Frenclimen,  we  have  : 


'})    =    -  ft*  1(4-4115)^     (  9775)2/ , 

^        208,425  N 


Here  p'  gets  larger  as  we  go  away  from  the  centre  of  the  surface  of  survivals, 
and  we  must  therefore  make  the  French  limiting  ellipse  just  touched  internally  by 
tlie  contour  line  of  the  surface  of  survivors  for  which  p'  =  1.  The  major  axis  of  this 
contour  line  for  p'  =  1  was  found  by  a  graphical  process  to  l)e  about  33"91.  This 


gives  for  k.. 


fc.  X  (4-411  5)  =  33-91,     or     k.  =  7-G8G7. 


Thus  •  W   =  I  =  -12^  i_  ^wmir 

inus.  208,425  N' 

leads  to  N'  =  32,460,000,000  about,  or  we  sliould  want  upwards  of  32,460,000,000  of 
Aino  to  produce  the  1000  Frenchmen.  The  bounding  contour  line  of  this  number  of 
Aino  has  a  major  axis  of  15 '890  centims.  about,  and  touches  the  French  limiting 
contour  in  the  point  in  whicli  it  is  touched  by  the  j^'  —  1  contour  of  the  surface  of 
survivals. 

Now  the  difference  between  these  two  unselected  populations  is  very  great.  We 
see  that  to  get  the  Aino  a  very  great  niunber  of  Frenclnnen  would  have  to  be 
exterminated,  about  7000  for  each  Aino  selected ;  but  to  get  the  Frenchmen  from 
the  Aino  an  appalling  nmnber  of  Aino  would  have  to  be  destroyed,  upwards  of 
32,000,000  for  each  Frenchman  selected.  Even  if  the  selection  were  not  catastrophic 
but  spread  over  centuries  and  centuries,  we  must  recognise  what  a  large  consumption 
of  life  there  must  be — individuals  destroyed  without  progeny*^ — if  we  are  to  supjDose 
any  highly  civilized  race  like  the  French  produced  by  selection  from  an  apparently 
primitive  type  like  tlie  Aino.  Indeed,  the  return  journey  in  this  case  seems  much 
easier  than  the  upward  ascent.  Beyond  all  this  we  have  only  made  French  and 
Aino  alike  for  two  organs,  and  only  for  one  character  of  each  of  them  !  Allowing  for 
our  conventional  limit  to  the  population,  allowing  for  the  fact  that  our  Aino  data  are 
drawn  from  a  very  limited  population  of  remarkably  small  variability,  it  seems  very 
improbable  that  the  French  have  ever  been  produced  by  selection  from  a  primitive 
race  at  all  resembling  the  Aino.  The  fact  that  the  Aino  could  be  so  much  more 
easily  obtained  by  selection  from  the  French  seems  to  indicate  that  they  are  rather 

*  Of  course,  with  a  secular  selection  spread  over  many  generations,  it  is  largely  the  potentiality  and  not 
the  actuality  of  life  which  is  destroyed.  Still,  while  the  gross  number  killed  among  a  small  primitive 
community  may  not  be  large,  the  death-rate  must  still  be  immense.  I  hope  to  return  to  these  points 
when  dealing  with  secular  selection  as  distinguished  from  catastrophic  selection. 


SELECTION  ON  THE  VARIABILITY  AND  COERELATION  OF  ORGANS.  59 

some  degenerate  offshoot  of  a  race  superior  to  themselves  than  a  sample  of  the 
primitive  people  from  which  the  Circassian  races  may  be  supposed  to  have  sprung. 

The  whole  of  this  discussion  is,  of  course,  very  hypothetical ;  no  stress  whatever  is 
to  be  laid  upon  it  except  as  an  illustration  of  method,  and  a  rough  appreciation  of  the 
vast  amount  of  elimination  which  must  be  necessary  to  evolve  one  race  from  a  second 
in  the  case  of  organs  which  we  know  by  measurement  to  have  continuity  of  variation, 
and  only  saltatoiy  changes  in  pathological  cases,  which  have,  as  far  as  we  can  judge, 
no  influence  on  the  mass-evolution  which  has  produced  the  local  races  of  man. 

But  given  fair  samples  of  material  our  method  will  enable  us  to  determine  whether 

a  race  A  for  of  course  a  limited  number  of  characters— could  with  less  destruction  be 

deduced  from  a  race  B,  than  the  race  B  from  A.  It  will  not  therefore  follow  that  the 
path  of  least  selection  is  that  which  necessarily  was  used  by  Nature.  Possibly  both 
A  and  B  have  been  reached  by  far  less  expenditure  of  material  from  C.  Still  it  is 
something  definite  in  the  midst  of  our  gropings  after  truth  in  problems  of  descent  to 
have  even  a  rough  appreciation  of  the  amount  of  selective  destruction  which  would 
arise  from  alternative  suggestions.  That  is  why  this  special  numerical  illustration  of 
the  surface  of  survival  has  been  given. 

The  reader  will  possibly  find  the  matter  rendered  somewhat  clearer  by  the  diagram. 
The  femur  is  measured  along  the  horizontal  and  the  humerus  along  the  vertical. 
A  is  the  type  or  mean  femur-humerus  of  the  Aino  population.  Within  in  the 
continuous  ellipse  round  A  the  whole  Aino  population  up  to  1  in  1000  would  fall. 
F  is  the  type  of  the  French  population  and  the  continuous  ellipse  round  F  gives  the 
area  within  which  up  to  1  in  1000  of  the  French  population  fall.  Since  the  diagram  is 
drawn  to  centimetres  of  the  bones,  it  will  be  seen  how  very  small  are  the  limits  of 
variation  within  both  poj)ulations.  P  is  the  centre  of  the  surface  of  survivals  ;  for  the 
selection  of  Aino  from  French  it  makes  the  "  fittest  to  survive."  In  the  case  of  the 
selection  of  the  French  from  the  Aino,  P  is  no  longer  the  centre  of  fitness,  but  the 
"  centre  of  unfitness  "  ;  the  Aino  are  killed  oft'  with  an  intensity  which  increases  the 
closer  we  approach  to  P.  Now  it  seems  to  me  that  these  two  cases,  which  are  quite 
distinct  in  theory,  ought  to  manifest  themselves  in  Nature  and  require  distinguishing 
names.  A  race  may  be  modified  because  a  complex  of  organs  with  a  certain  system 
of  values  is  good  for  it,  or  because  it  is  bad  for  it.  The  race  may  be  modified  l^ecause 
a  certain  element  of  it  is  fittest  or  because  it  is  unfittest  to  survive.  In  the  former 
case  we  select  for  survival  round  the  centre,  in  the  latter  case  we  select  for  destruction. 
I  propose  to  call  these  cases  positive  and  negative  selection  respectively.  It  may  be  said 
that  if  there  be  positive  selection  in  one  part  of  the  population  there  will  be  negative 
in  another.  But  the  kernel  of  the  matter  is  in  either  case  the  existence  of  a  centre, 
a  definite  set  of  most  fit  or  of  most  unfit  organs,  while  in  positive  selection  the  less  fit 
organs,  and  in  negative  selection  the  more  fit  organs  are  distributed  over  wide  areas 
of  the  field,  and  do  not  i^each  a  maximum  of  unfitness  or  a  maximum  of  fitness 
respectively  for  any  definite  individual. 
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In  the  diagram  we  have  also  drawn  the  contour  line  to  wliich  the  French  population 
must  extend  if  we  are  to  get  at  least  a  representative  population  of  1000  Aino  from 
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it,  and  further  the  contour  line  to  which  the  Aino  population  must  extend  if  we  are 
to  get  at  least  a  representative  population  of  1000  French  from  it.    A  consideration  of 


SELECTION  ON  THE  VARIABILITY  AND  COERELATION  OF  ORGANS. 


61 


the  nature  of  the  contour  lines  of  the  surface  of  survivals  shows  that  the  contour  lines 
ahove  referred  to,  and  marked  "boundary"  in  the  diagram,  must  touch  the  Aino  1  in  1000 
limit  and  the  French  1  in  1000  limit  respectively  at  the  points  in  which  they  are 
touched  by  tlie  contour  lines  p  =  1  and  =  1  of  the  corresponding  surfaces  of 
survivals.  I  liave  already  indicated  that  the  major  axes  of  these  boundaries  are 
15-285  for  the  French  and  15-890  for  the  Aino.  The  corresponding  values  of  the 
parameter  k  are  respectively  given  by 

/15-2SrV\  15-890       ^  r.^^n 

Hence  by  (cxviii.)  we  can  easily  find  the  frequency  of  population  outside  the 
contours  k^,  and  Ka  ;  if  these  be  t-j,-  and      we  have  : 

I.,.  =  -000,000,163,  =  •000,000,000,015. 

Thus  the  Frencli  population  would  liave  to  be  extended  to  a  boundary  in  which  only 
about  1  in  six  millions  was  excluded,  and  the  Aino  population  to  a  boundary  excluding 
only  1 5  in  the  billion !  The  boundaries  of  what  we  may  thus  term  the  selection 
populations  are  far  larger  than  our  conventional  boundaries  of  1  in  1000  for  represen- 
tative populations.  In  fact,  it  would  be  impossible  to  select  a  representative  Aino 
jaopulation  from  a  conventional  representative  French  population  and  vice  versa — in 
either  case  the  very  exceptional  members  of  French  or  Aino  populations  are 
required  to  complete  the  conventional  representative  populations  of  Aino  or  French 
by  selection. 

(15.)  I  have  devoted  most  of  my  consideration  of  the  surface  of  survivals  to 
a  particular  case  in  which  two  organs  have  been  selected,  and  we  consider  the  nature 
of  p  which  determines  the  fraction  of  each  group  of  individuals  which  survives. 
I  have  done  this  partly  because  normal  surfaces  ai-e  at  best  only  an  approximate 
representation  of  our  selectable  and  selected  distributions,  and  partly  because  I  have 
thought  a  concrete  case  would  best  bring  out  the  general  points  of  investigations  ol 
this  kind. 

But  some  little  indication  of  the  properties  of  the  surface  of  survival-rates  ought 
to  be  indicated  here,  or  it  may  appear  that  they  have  been  overlooked.  While  the 
contour  lines  of  the  correlation  frequency  surfaces  for  two  organs  must  be  ellipses,  this 
does  not  follow  in  the  case  of  the  surface  of  survival-rates.  In  our  illustration  they 
were  ellipses,  but  they  may  be  also  parabolas,  hyperbolas,  or  even  straight  lines. 
We  must  not  therefore  expect  to  find  always  a  "  centre "  of  positive  or  negative 
selection.  We  may  come  across  a  "  saddle-back  system  "  of  contours  with  the  rate 
of  survival  constant  along  two  intersecting  lines,  but  rising  in  one  pair  of  opposite 
angles  and  falling  in  the  other  pair.  In  this  case  we  have  fields  of  negative  and 
positive  selection  separated  by  two  independent  relations  between  the  two  organs. 
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which  are  hnear  and  for  which  the  survival-rate  is  the  same,  they  may  he  termed  the 
"  critical  hnes."  For  one  pair  of  angles  the  centre  is  now  a  "  centre  of  fitness,"  for 
the  other  pair  of  angles  a  "  centre  of  unfitness."  It  seems  to  me  that  these  critical 
organic  relations  may  possess  considerahle  biological  importance. 

If  the  contour  lines  of  the  surface  of  survival-rates  are  parabolas,  we  have  really 
only  a  limiting  case  of  the  centre  at  a  very  great  distance.  It  is  one  in  which  the 
fittest  (or  most  unfit)  has  no  jjractical  existence,  hut  there  is  a  direction  towards 
which  the  rate  of  survival  will  be  found  to  be  always  increasing  or  decreasing. 

If  the  contour  lines  of  the  surface  of  survival-rates  are  parallel  straight  lines,  then 
so  long  as  the  deviation  in  one  organ  has  a  certain  definite  relation  to  that  in  the 
other,  the  survival-rate  will  remain  constant.  In  this  case  the  survival-rate  will 
fall  uniformly  in  one  direction  and  remain  constant  in  the  direction  at  right  angles 
to  it. 

All  the  cases  I  have  given  here  can  occur  j\ist  as  easily  as  the  elliptic  contour 
system  of  our  illustration  and  diagram.  Each  is  marked  by  quite  definite  l>iological 
characteristics,  and  we  may,  perliaps,  class  them  as  elliptic,  hyperbolic,  parabolic, 
and  linear  selection.  Even  if  the  surface  of  survival-rates  be  not  of  the  exponential 
quadric  type  discussed  in  this  paper,  yet  to  the  neiglibourhood  of  each  part  of  it 
this  classification  of  selection  types  will  apply. 

If  we  pass  to  moi'e  than  two  organs,  then  similar  considerations  will  apply  ;  we 
shall  only  be  reproducing  the  geometry  of  quadric  surfaces  in  space  of  three  and 
higher  dimensions.  But  before  we  allow  ourselves  excursions  into  the  liii^her 
geometry  of  the  surface  of  survival-rates,  it  seems  desirable  that  we  should  obtain 
quantitative  determinations  of  this  surface  by  experiments  in  artificial  selection. 
We  shall  then  be  better  able  to  see  what  part  of  our  geometry  will  really  be  of 
service  for  the  ^^roblems  of  natural  selection.  The  field  is  too  large  to  be  cultivated 
for  merely  theoretical  interests.  We  must  first  determine  what  parts  of  it  are  likely 
to  have  practical  application  to  life  as  we  find  it,  but  of  death-rates  in  the  case  of 
any  living  form  but  man,  we  are  at  present  sadly  ignorant. 


SP]LECTION  ON  THE  VAlilABILITY  AND  CORRELATION  OP  ORGANS. 


Key  to  Selective  Correlation  Tables. 


Case. 

Values  of 

ri2. 

(") 

0 

0 

(b) 

0 

•25 

(0) 

0 

•5 

(d) 

0 

■75 

{'') 

0 

1 

if) 

•25 

•25 

(.'/) 

•25 

•5 

(/') 

•25 

•75 

(i) 

•25 

1 

(./) 

•5 

•5 

(k) 

•5 

■75 

{/) 

•5 

1 

('") 

•75 

•75 

(n) 

•  75 

1 

iP) 

1 

1 

Formula  : 

R'  =  R  +  10  (A,R)      +  4  [(A,^R)  Br,,  +  (A,^R)  Br,,  +  (A,.^R)  Br,,]. 
Occasionally  second  differences  must  be  used. 
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Generalised  Theory  of  Alternatioe  Inheritance,  with  special  Reference  to 
Mkndel's  Laics. 

By  Karl  Pearson,  F.R.S. 

Received  September  11,— Read  November  26,  1903. 

(1.)  Introductovjj.    On  a  Generalised  Theorij  of  Alternative  Inheritance,  ivith  sjjecial 

Reference  to  Mendel's  laivs* 

It  seemis  likely  to  be  of  interest  at  the  present  time  to  consider  rather  at  length  a 
fairly  full  mathematical  theory  of  the  pure  gamete.  We  do  not  venture  to  call  this 
theory  a  generalised  Mendelian  theory  of  inheritance,  partly  because  it  is  not  even  the 
most  general  theory  of  the  pure  gamete  conceiva1)le,  partly  because  Mendel's  original 
theory  of  heredity  was  perfectly  clear  and  perfectly  simple,  and  is  not  the  theory  here 
developed.  The  pure  and  simple  Mendelian  theory  seems  to  have  been  discarded  in 
the  light  of  recent  experimental  results  by  more  than  one  Mendelian,  both  in  this 
country  and  abroad.  The  original  Mendelian  theory  has  been  replaced  by  what  are 
termed  "  Mendelian  Principles."  In  this  aspect  of  investigation  the  fundamental 
principles  propounded  Ijy  Mendel  are  given  up,  and  for  each  individual  case  a  pure 
gamete  formula  of  one  kind  or  another  is  suggested  as  describing  the  facts,  t  This 
formula  is  then  emphasised,  modified  or  discarded,  according  as  it  fits  well,  Imdly,  or 
not  at  all  with  the  growing  mass  of  experimental  data. 

It  is  quite  clear  that  it  is  impossible  while  this  process  is  going  on  to  term 
anything  whatever  Mendelian  as  for  as  theory  is  concerned.  The  present  investigation 
is  therefore  not  a  generalised  Mendelian  theory  of  heredity  :  we  speak  of  it  merely 
as  a  generalised  theory  of  alternative  inheritance,  and  it  is  based  on  the  conception 
that  the  gamete  remains  pure,  and  that  the  gametes  of  two  groups,  while  they  may 
link  up  to  form  a  complete  zygote,  do  not  thereby  absolutely  fuse  and  lose  their 

*  I  owe  the  incentive  to  this  memoir  to  Professor  W.  F.  R.  Wi':ldon,  who  had  already  worked  at  some 
of  the  simpler  special  cases  and  who  placed  his  results  entirely  at  my  disposal. 

t  See  especially  Tschermak,  '  Zeitschrift  f.  d.  landwirthsch.  Versuchswesen,'  Jahrg.  IV.  ("  Ueber 
Ziichtmig  neuer  Getreiderassen ") ;  De  Vries,  '  Ber.  d.  deutsch.  botan.  Gesellsch.,'  vol.  xviii.  (1900), 
pp.  435-443 ;  Bateson,  '  Proc.  Camb.  Phil.  Soc.,'  vol.  12,  p.  53  ;  '  Nature,'  April  and  May,  1903. 
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identity.  The  analytical  expression  of  this  is  represented  l)y  the  fundamental 
formula  : 

(AA')  X  {aa')  ^  =  [  Ka 
=  (A  +  A')  («  +  a)  I        1  Aa' 


j        ^  k'a' 

where  (AA')  and  {aa')  are  the  parental  zygotes,  and  the  right-hand  side  of  the 
equation  represents  the  four  230ssih]e  constitutions  of  tlie  offspring.  Such  a  formula 
as  the  above  may  be  accepted  without  any  hypothesis  as  to  dominant  and  recessive 
characters,  but  these  terms  were  certainly  essential  to  Mendelian  theory  as  propounded 
by  Mendel  himself,  and  it  becomes  very  doubtful  whether  ^va  ought  to  attach  his 
name  to  any  theory  whicli  discards  these  "  recognition  marks."  It  is  very  convenient, 
however,  to  have  names  for  the  alternative  elements  expressed  by  capital  and  small 
letters  respectively.  I  propose  for  the  purpose  of  this  paper  to  term  an  A-element  a 
protugene,  and  an  a-element  an  allogene.  Two  protogenic  elements  will  give  rise  to  a 
protogenic  zygote  AA,  two  allogenic  elements  to  an  allogenic  zygote  aa,  and  a 
protogenic  and  allogenic  element  to  what  Mr.  Batesox  has  termed  a  heterozygote 
A«.  We  may  thus  class  his  homozygotes  into  protozygotes  and  allozygotes.  We 
reach  pui  e  Mendelianism  by  making  our  protozygotes  "  dominants,"  our  allozygotes, 
"  recessives,"  and  our  heterozygotes  "  hybrids  of  dominant  character."  In  so  far  as 
our  theory  of  pure  gametes  rej)laces  protozygote,  allozygote,  and  heterozygote  by 
'■  dominant,"  "  recessive,"  and  "  hybrid  with  dominant  character,"  it  becomes  a 
generalised  Mendelian  theory,  but  only  in  this  case.  Otherwise  we  must  look  upon 
it  as  an  attempt — in  one  direction  only  of  course — to  give  a  consistent  mathematical 
basis  to  the  various  formulae  which  have  been  propounded  for  describing  statistical 
data  classed  under  Mendelian  categories ;  shortly  we  shall  endeavour  to  develop  a 
general  pure  gamete  theory. 

The  results  were  worked  out  in  a  purely  impartial  frame  of  mind  ;  indeed,  once 
state  the  hypotheses,  and  the  analysis  is  far  too  complex  to  allow  us  to  predict^a 
jjriori  what  can  possibly  result  from  it,  nor  does  the  investigation  admit  of  any  but 
one  solution.  If  the  hypotheses  are  admissible,  then  any  narrower  pure  gamete 
formula  must  lead  to  results  embraced  under  our  general  conclusions. 

What  we  have  to  admit  at  the  present  time  are  the  following  conditions  : — 

(i.)  The  existence  of  a  vast  bulk  of  evidence  that  heredity,  as  far  as  measurable 
characters  are  concerned,  follows  within  a  population  perfectly  definite  laws. 

(ii.)  The  existence  of  another  mass  of  experiments,  in  which  simple  and  pure 
Mendelianism  is  certainly  inadmissible,  but  in  which  certain  ratios  undoubtedly 
approach  the  values  they  would  have  on  such  a  simple  and  pure  Mendelian  theory. 

It  is  possible,  therefore,  that  a  generalised  theory  of  the  pure  gamete  would  account 


INHERITANCE,  WITH  SPECIAL  REFERENCE  TO  MENDEL'S  LAWS. 


55 


tor  (ii.) ;  it  can  only  do  so  satisfactorily,  however,  if  it  does  not  contradict  the  results 
of  (i.).  Hence  arises  the  present  attempt  to  develop  in  one  direction  a  generalised 
theory  of  the  action  of  the  germ-cell. 

As  we  have  frequently  had  to  assert,  the  laws  investigated  under  (i.)  have  nothing 
whatever  to  do  with  any  physiological  hypothesis.  That  a  physiological  hypothesis 
leads  to  them  is  not  nuich  test  of  its  validity— it  is  a  necessary,  but  not  sufficient, 
criterion  of  its  correctness.  If,  however,  it  contradicts  them,  we  are  bound  to 
discard  it,  and  seek  for  its  modification  or  replacement.  The  present  study  is  an 
attempt  to  see  how  far  one  generalised  pure  gamete  theory  leads  to  results  in 
accordance  with  the  law  of  regression  and  the  known  nature  of  the  distributions  of 
otispring  in  jJopulations. 

(2.)  Nature  of  Hypothesis  adopted. 

We  start  with  a  zygote  consisting  not  of  a  single  protogenic  pair  AA,  but  built 
u])  of  11  such  pairs, 

AjA^  +  AoA.  +  A3  A3  +  .  .  .  +  A,  A,, 

We  suppose  this  to  produce  gametes  which  unite  with  those  of  a  similar  allogenic 
zvtrote 

a^a^  +  ttott^  +  <-h^z  +  •  •  •  +  a„a.ii. 

Any  element  of  the  protogenic  gamete  must  unite  with  the  corresponding  element 
of  the  allogenic  gamete,  i.e.,  A,,  with  a,-,  and  hy  the  fundamental  principle  (i.)  above, 
this  gives  rise  to  the  foin-  possibilities 

A,a,- , 
AfUf , 
o^;-A-/ , 
a,.Ar , 

wliich  are  all  of  the  same  constitution.   The  result  is  the  hybrid  group,  symbolised  by 

a^A^  +  a,A,  +  a-^A.^^  +  .  .  .  +  a,A„, 

the  perfect  multiple  heterozygote. 

The  population  will  now  be  supposed  to  consist  of  any  number  of  such  perfect 
heterozygotes,  which  we  shall  suppose  to  again  cross.    We  shall  now  have 

a,A,  X  a,A,.  =  ^-  a,.a,. 

\  a,A,. 
^  Aj-a,. 
lA,A,., 
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or,  each  couplet  will  give  rise  to  four  possibilities,  representing,  liowever,  only  three 
constitutional  differences,  expressed  by 

Since  these  four  possibilities  may  occur  with  each  of  the  ?i  couplets,  we  shall  have 
when  two  perfect  heterozygotes  cross,  4"  resulting  possibilities.  These  form  the 
I'esulting  population  of  the  second  generation.  Our  first  problem  will  be  to  find  the 
distribution  of  this  population.  This,  according  to  Mendel,  is  the  segregating 
generation.  We  must  inquire  into  the  frequency  of  each  constitutional  difference  in 
this  segregating  generation. 

We  now  reach  our  second  limiting  hypothesis,  which  is  needful  if  we  are  to  apply 
our  tlieory  to  sexual  reproduction.    We  suppose  : — 

There  to  be  an  absence  of  homogamy  (including  self-fertilisation),  and  the  members 
of  the  second  generation  to  cross  absolutely  at  random  and  with  equal  fertility.* 
We  have  then  to  ask  what  is  the  distribution  of  constitutional  differences  in  tlie 
third  generation.  Does  tlie  process  of  segregation  begun  in  the  second  generation 
continue  to  the  third,  or  does  the  population  now  remain  stable  ?  Is  the  continual 
segregation  into  pure  protogenic  and  allogenic  individuals  a  necessary  result  of  any 
pure  gamete  theory,  or  does  the  belief  in  such  necessity  depend  upon  the  first 
Mendelian  experimenters  working  only  with  self-fertilising  individuals  ? 

(3.)  Problem  I. — To  find  the  Distribution  of  the  Offspring  of  the  Perfect 

Heterozygotes. 

We  shall  here  use  a  symbolic  form  of  analysis.  Let  n  stand  for  aa,  v  for  aA,  and 
w  for  AA  ;  then  any  corresponding  couplets  will  give  rise  to 

u  +  2v  4-  w, 

and  any  one  of  these  constitutions  may  be  associated  with  one  of  the  similar 
constitutions  in  any  of  the  remaining  n  —  I  couplets.  Hence  the  general  distribution 
of  the  population  will  be  given  by  the  terms  of  the  multinomial 

{u  +  2y  +  in)". 

This  equalst 

a"  +  nu"-^  (2y  -f  iv)  +  '^^^^  ~     u''--  {2v  +  ivf  +  .  .  .  +  c,, „ v,*^'"*  {'2v  +  ivY  +  . .  . 

1 .  2 

*  If  one  is  to  stady  heredity  in  populations  with  a  view  to  the  proUeni  of  evolution,  the  conditions  as 
to  fertilisation  should  approach  as  far  as  possible  the  conditions  we  suppose  them  to  be  under  in  a 
natural  state;  we  must  fix  our  attention  on  the  nutM  relations  lietweeii  successive  generations  of  the 
population. 

t  Throughout  this  memoir  the  symbol  c^,  p, is  used  for  the  expi'ession  |_ft  /  { 7  \p  \g]- 
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Thus,  for  example,  there  would  be  out  of  the  total  population  of  possibilities  4" : 
1  purely  allogenic  individual,  n  X  3  individuals  with  n—1  allogenic  couplets ;  2n  of 
these  would  have  one  heterogenic  couplet,  and  n  would  have  one  protogenic 
couplet. 

Generally  there  would  be  3*  c„,,,,„  individuals  with  n  —  s  allogenic  couplets,  and 
these  hidividuals  would  be  distributed  according  to  the  terms  of  the  binomial 
{2v  +  ivy. 

We  are  thus  able  to  write  down  at  once  the  number  of  any  class  of  individual  that 
can  appear  in  the  segregating  generation.  For  example,  how  often  do  individuals 
like  u"-p-HiPw'i  appear,  i.e.,  individuals  with  n  —  p  —  q  allogenic,  heterogenic,  and 
q  protogenic  couplets  ? 

To  answer  this  problem  all  we  have  to  do  is  to  pick  out  the  coefficient  of  i<]'-p^HPw1 
in  the  above  multinomial,  and  the  result  is 

We  are  thus  fully  able  to  predict  how  many  individuals  of  each  kind  ought  to 
occur  when  a  population  of  perfect  ?i-couplet  hetero25ygotes  are  crossed. 

Coi-ollary  (i.). — Let  us  consider  only  the  number  of  allogenic  couplets  in  the 
distribution  of  the  segregating  generation.  If  we  were  "  pure  Mendeliaus  "  we  should 
for  the  purpose  of  character  classification  make  v  =  iv,  as  the  heterogenic  couplet 
would  then  give  the  dominant  character.  But  without  doing  this  we  can  assume 
V  and  IV  to  be  non-zt's. 

Hence  the  distribution  of  allogenic  characters  in  the  population  follows  the  simple 
binomial 

^"  (i  +  f 

Thus  we  see  that  the  distribution  would  be  a  skew  binomial  closely  approximating 
to  my  skew  curve  of  Type  III.,*  and  becoming  indefinitely  close  to  a  normal 
distribution  of  the  form 

y  =  y,<i-''"'\ 

\vhen  the  number  of  couplets,  n,  is  indefinitely  increased. 

For  any  value  of  n  the  mean  of  the  skew  binomial  as  measured  by  the  formula  of 
the  memoir  on  "  Skew  Variation,"  f 

=  ^  +  1  -  (1  +  h)  =  hh 

atld  the  standard  deviation  =  \/ -^0^. 

Thus  the  mean  number  of  allogenic  couplets  in  the  members  of  the  segregating 
g'enferation  is  ^  c-f  the  total  number  of  couplets. 

*  'Phil.  Trans.,'  A,  vbl.  18G,  p.  373: 
t  Ibul.,  p.  346; 
tOL.  OCIll. — A:  I 
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Corollary  (ii.). — The  distribution  of  heterogenic  couplets  in  the  segregating 
generation  is  given  by  the  symmetrical  binomial 

4«  (I  +  |)«. 

The  mean  number  is  therefore  ^n,  and  the  standard  deviation  =  \/^n. 

This  is  a  symmetrical  binomial,  and  approaches  extremely  closely,  even  for  a  fairly 
small  value  of  n,  to  the  normal  curve.  We  see  that  if  any  character  depends  solely 
upon  heterogenic  couplets,  the  distribution  will  be  nearly  normal,  and  the  variability 
slightly  greater  than  one  depending  on  the  allogenic  (or,  of  course,  the  protogenic) 
couplets  only. 

To  sum  ujD,  then,  so  far  as  the  distribution  of  characters  depending  upon  allogenic 
or  heterogenic  couplets  goes,  we  may  say  that  a  generalised  theory  of  the  pure 
gamete  leads  us  to  those  normal  and  'skew  distributions  of  frequency  with  which 
biometric  studies  of  variability  have  made  us  already  familiar.  It  would  not  be 
possible  to  base  a  crucial  experiment  on  the  existence  or  non-existence  of  such 
frequency  distributions.  The  generalised  pure  gamete  theory  would,  however, 
account  for  their  appearance,  which,  of  course,  a  purely  descriptive  statistical  theory 
cannot  do.  On  the  other  hand,  distributions  diverging  much  beyond  the  errors  of 
random  sampling  from  binomials  of  the  above  types  would  tell  pro  tatno  against  the 
pure  gamete  theory  in  its  above  form.  The  presence  of  binomials  of  two  types  only, 
(I"  "1"  ^^^d  {\  +  f ought  to  be  capable  of  detection,  even  if  it  would  not  already 
have  been  discovered,  had  it  been  the  rule. 

(4.)  Proposition  II. — To  determine  the  Distribution  of  the  Offspring  of  the 
Segregating  Gejieration,  supposing  them  to  Mate  at  Raridom  and  ivithout 
Differential  Fertility. 

The  solution  of  this  problem  may  be  reached  as  follows  : — 

Suppose  P  any  male,  and  Q  any  female,  say  each  of  n  —  1  couplets,  producing  an 
array  of  offspring,  which  we  will  denote  by  E, ;  now  suppose  an  additional  couplet,  the 
n"',  added  to  both  male  and  female  zygote.    The  male  may  be  now : 

P  +  a„a,„    or    P  +  a„A„,    or    P  +  A  ua,i,  or    P  -\-  A.,iA.,i  i 

and  the  female  may  be 

Q  +  or    Q  +  Q  + 

A.,ia„,  or    Q     A„A„ ; 

that  is,  we  get  4X4  new  mating  individuals,  with  4X4X4  new  oflfspring 
possibilities. 

Now  consider  the  first  father  P  +  a^a,, ;  the  possibilities  which  arise  from  mating 
him  with  the  four  mothers  are  the  array  K  of  offspring  combined  with  any  one  of  the 
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16  possibilities  8a„««  +  '&ci„K,„  or  this  is  the  same  thing  as  multiplying  the  R  array 
by  the  symbolic  factor  8  [u  +  v)  =  16U,  say. 

The  next  pair  of  fathers  2  (P  +  a,iA„)  with  the  four  mothers  reproduce  the  array  R 
of  offspring  combined  with  8  +  2a„A„  +  A„A„),  or  32  jDossibilities.  But  this  is 
the  same  as  multiplying  the  B-array  by  the  symbolic  factor  8  (u  -\-  2v  iv)  =  32V, 
say.  Lastly,  the  P  +  A„A„  father  with  the  four  mothers  gives  16  possibilities  of  the 
form  8a,,A;,  +  8A„A„  to  be  combined  with  the  B-array  of  offspring,  which  is  the  same 
thing  as  multiplying  the  B-array  by  the  symbolic  factor  8     +  iv)  =  16W,  say. 

We  have  at  once  the  symbolic  relation  among  the  operators  : 

U  +  2Y  +  W  =  u  -{- 2v  +  w; 

and,  further,  the  important  result  that  the  array  of  offspring  due  to  any  pair  P  and  Q 

of  n  —  1 -couplet  parents  can  be  converted  into  the  arrays  of  offspring  due  to  the  16 
pairs  of  parents  formed  by  adding  an  additional  couplet  to  P  and  Q,  by  multiplying 
that  array  by  the  symbolic  factor 

16U  H-  32V  H-  16W  =  16  {ii  +  2v  +  tv). 

We  have  thus  by  induction  a  means  of  finding  the  array  of  offspring  due  to  a 
population  of  parents  of  n  couplets  from  the  series  of  arrays  due  to  a  population  of 
n  —  1  couplets.  Since  all  the  arrays  are  to  be  multiplied  by  the  same  symbolic 
factor,  we  can  multiply  their  total  by  this  factor.  Or  the  distribution  of  offspring  of 
{ii  —  l)-couplet  parents  being  J,  that  of  n-couplet  parents 

=  16{u  +  2v+  io)J  =  4  X  4  X  4.(ltt  +  >  f  ^w)  J. 

Now  consider  parents  of  one  couplet,  their  distribution  is  given  by  aa  +  2«A  -f  AA, 
and  they  are  to  mate  with  the  same  series,  aa  -f  2aA  -{-  AA. 
But 

aa  X  aa  =  Acta, 
2  (aa  X  2aA)  =  2  {iaa  +  4aA), 
2  (aa  X  AA)  =  2  (4«A), 
2aA  X  2«A  =  iaa  -f  SaA  -f  4AA, 
2  {2a A  X  AA)  =  2  (4aA  +  4AA), 
AA  X  AA  =  4AA, 

Total  =  16aa  +  32aA  -f  16AA. 
=  16{u  +  2v  +  iv)  =  4x  4.x  A  {iu  +  |y  -f  ^iv) 

symbolically. 

I  2 
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Hence,  by  the  above  proposition,  the  distribution  of  offspring  of  parents  of  two 
couplets  is 

4X4X4.  {^u  +  |y  +  Iw)  X  4  X  4  X  4  .  (i«  +      +  {iv) 
=  43  X  42  X  4- .  (in  +  |v  +  \iv)\ 

and,  by  induction,  the  distribution  of  offspring  for  the  random  mating  of  parents  of 
n  couplets  is 

4"  X  4"  X  4" .  (\u  +  f  V  +  iiv)". 

Tliis,  except  for  the  constant  factor  4"  X  4",  is  absolutely  identical  with  the 
distribution  of  the  parental  population,  and  accordingly  if  the  next  generation  also 
mates  at  random,  the  mixed  race  will  continue  to  reproduce  itself  w^ithout  change. 
We  therefore  reach  the  following  result : — • 

However  many  couplets  ive  suppose  the  character  under  investigation  to  depend 
upon,  the  offspring  of  the  hybrids — or  the  segregating  generation — if  they  breed  at 
random  inter  se,  ivill  not  segregate  further,  but  continue  to  reproduce  themselves  in 
the  same  proportions  as  a  stable  population. 

It  is  thus  clear  that  the  apparent  want  of  stability  in  a  Mendelian  population,  the 
continued  segregation  and  ultimate  disappearance  of  the  heterozygotes,  is  solely  a 
result  of  self-fertilisation ;  with  random  cross  fertilisation  there  is  no  disappearance 
of  any  class  whatever  in  the  offspring  of  the  hybrids,  but  each  class  continues  to  be 
reproduced  in  the  same  proportions.  Thus  our  generalised  theory  lends  no  countenance 
to  the  appearance  of  any  "  mutations "  -within  a  hybrid  population  under  random 
mating ;  the  only  appearance  of  new  constitutions  is  in  the  segregating  generation,  or 
the  first  generation  of  hybrid  offspring.  Except  at  this  stage,  the  appearance  of  the 
unfamiliar  is  only  the  chance  occurrence  of  a  very  rare  normal  variation.  When  we 
recollect  that  a  purely  allogenic  individual  is  only  to  be  expected  once  in  a  population 
of  4"  individuals,  or  if  there  be  ten  couplets,  once  in  more  than  a  million  individuals, 
it  will  be  clearly  seen  that  the  rarity  of  some  of  the  more  exceptional  normal 
constitutions  may  easily  lead  to  their  being  looked  upon  as  "  mutations,"  even  if  they 
appear  in  the  offspring  of  a  population  many  generations  removed  from  hybridisation. 

(5.)  Proposition  III. — To  find  the  Array  of  Offspring  due  to  a  Parent  of  given 
Gametic  Constitution  mating  at  Random. 

This  can  be  again  deduced  by  the  method  of  induction  adopted  in  the  last 
proposition. 

Supposing  a  male  P  of  n  —  1  couplets  to  mate  with  all  possible  females,  and  R„_i 
to  be  the  array  of  ofiPspring,  then  we  have  seen  in  the  last  proposition  that  if  we  add 
an  71^'^  couplet  to  P,  the  array  of  offspring  due  to  P  +  a^a,,  will  be  lGUR„_i  ;  if 
we  add  a  couplet  a,±,„  the  array  of  offspring  due  to  fathers  of  type  P  +  a,,K,,  will  be 
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16VR,_],  and  if  we  add  a  couplet  of  form  A„A„,  the  array  will  be  of  the  form  8WPt„_i. 
Now  start  with  a  f;\ther  of  one  couplet ;  this  must  he  a^a^,  or  a-^A^,  or  Aj  A^,  or  in  our 
symbolic  notation  n,  v,  or  w  ;  the  oft^^pring  array  are  respectively  Sa^a^  +  8a^A^  or 
4a^a^  +  SrYjAi  +  4A^Ai,  or  ^a^A^  +  SAiAj,  i.e.,  IGU,  16V,  or  IGW.  These,  therefore, 
are  tlie  possll)le  values  of  Ji^.  Hence,  l)y  tlie  principle  just  developed  above,  the 
array  of  offspring  due  to  a  father  of  type 

U'^'P'J  VP 

is 

{16JJ)"-P-J  {UYy  (IGW)^ 

or  remembering  that  such  fathers  occur  with  a  frequency  of  2Pc„^p  q,  we  have  for  the 
total  distribution  of  offspriiig  of  all  fathers  of  type 

or 

the  symbolic  result 

4"  X  4".  c,_^,,,,U"-^~?(2V)''W?. 

Substituting,  the  following  expression  v/ould  give  all  offspring  of  fathers  of  the 
type  u^'P'^v^ivi,  i.e.,  with  n—p~q  allogenic,  p  heterogenic,  and  q  protogenic 
couplets 

4"  X  4" .  c„,p,,       +  ^vy-p-i  (1?^  +  V  +         {^v  + 

Therefore,  given  n  and  given  p  and  it  is  merely  a  matter  of  expansion  to  find  the 
array  of  offspring  due  to  any  special  class  of  father. 

Corollary  (i.). — So  far  we  have  supposed  our  special  class  of  father  to  be  defined  by 
the  exact  couplet  distribution  constitutional  to  him.  But  it  is  of  interest  to  consider 
the  array  of  offspring  we  get  supposing  only  the  allogenic  couplets  fixed  in  number, 
for  example,  in  a  generalised  Mendelian  theory  if  the  number  of  recessive  couplets  be 
fixed,  but  the  heterogenic  and  dominant,  as  both  exhibiting  dominant  characters,  be 
considered  as  indifferent.  Let  s  =  number  of  allogenic  couplets,  then  we  have  to 
sum  all  arrays  like 

4"  X  4" .  c„,;,,U^(2V)^W?, 

subject  to  the  condition  that  2)      q  =  n  —  s. 
The  result  is  clearly 

4"  X  4^c,,,,„S{v,,^,,(2V)?W^} 

=  4"  X  4" .  c„/,,„u^  (2V  +  wy-' 

=  4«  X  4" .  c„, ,  {^u  +  1^')'  ih''  +  f V  +  w))"-* 
=  4"  X  4»  .  c,,  ,  „  (Iw  +  ^vy  {(Im  +  h')  -^{v  +  10^"-'. 
This,  we  note,  is  not  a  pure  binomial,  or  the  arrays  of  offspring  of  a  father  with  a 
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given  allogenic  constitution  are  not  either  symmetrical  or  skew  binomials,  but  of  a 
much  more  complex  character.  The  only  exception  is  the  array  of  offspring  of  pure 
allogenic  fathers,*  which  is  given  by 

4"  X  4"  X  ilu  -f  Iv)". 

This  is  a  symmetrical  binomial.  This  result  is,  of  course,  of  special  interest,  for  it 
gives  us  the  distribution  of  offspring  if  the  hybrid  offspring  were  at  any  time  crossed 
with  the  pure  allogenic  race,  which  was  one  of  the  original  factors  of  the  hybridisation. 
The  deviation  from  binomial  distribution  in  the  above  arrays  ought  to  be  further 
considered,  for  if  this  deviation  should  turn  out  to  be  very  significant,  it  would  form  a 
convenient  test  for  any  generalised  theory  of  pure  gametes. 

Corollary  (ii.). — If  we  sum  the  above  expressions  for  the  array  of  offspring  of  all 
fathers  of  p  allogenic  couplets  for  values  of  s  from  o  to  n,  we  have  the  total  offspring 
pojDulation 

=  4"  X  4" .  Sc„, (2V  +  W)"-^ 
=  4"  X  4"  X  (U  +  2V  +  W)" 
_  4«  X  4«         +  2v  +  iv)\ 

a  result  we  have  already  found  in  Proposition  II.  as  giving  the  distribution  of  the 
total  offspring  population. 

(6.)  PiiorosiTiON  IV. — To  find  the  Mean  Number  of  Allogenic  Couplets  in  the 
Offspring  of  all  Fathers  having  in  their  Constitution  s-allogenic  Couplets. 

By  the  first  corollary  to  the  last  proposition  the  distribution  of  such  offspring  is 
given  by 

4"  X  4«  .  c„, „  (1m  +  Ivy  {^u  +      +  ^}"-', 

where  t]  is  written  for  ^  \v  +  iv),  a  quantity  which  is  unity  so  long  as  we  consider  not 
the  distribution,  hut  the  total  number  of  the  non-allogenic  couplets.  Now  this  is 
clearly  the  sum  of  a  number  of  symmetrical  binomials  in  ^u  -f  i'^,  and  may  be  put 

=  4"  X  4" . '  i  'c„,,,i  (|w  +  \vY-'  {2-q)\ 

Now  the  means  of  each  of  these  binomials  can  be  found  from  the  general  theory  of 
the  l^inomial.t  If  we  take  our  origin  at  w  +  1  allogenic  couplets,  with  a  frequency 
zero,  the  mean  of  the  first  binomial,  or 

*  Or,  of  course,  the  array  of  sons  from  pure  protogenic  fathers, 
t  'Phil.  Trans.,'  A,  vol.  186,  p.  373, 
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(ij,,    ^t')"  is  at  1  +  h^,  and  its  total  frequency  fi  =  4"  X  4"  X  c,,, „ ; 
the  mean  of  the  second  binomial,  or 

+  2  +  1(7.-1).    .    .    .    f,  =  '^2f,; 

the  mean  of  the  third  binomial,  or 

(Lu+M'-'  •  ■  ^+Un-2).  .  .  .    =         ' " ^ 2^; ; 

the  mean  of  the  {i  +  1)"'  binomial 

(iw + i-i')"  '  .  .  *  + 1 + - .    y.- + 1  =  ^  ^  T72  .  S  . . .  i  

The  total'  frequency  is  accordingly 

=y; +/,+/;  +  ...  =yi(i +  2)-3 

=  4"  X  4"c„,,,<,3'''-^ 

Hence  if  be  the  distance  from  the  same  origin  of  the  mean  of  the  above 
system  of  binomials 

X  3-^  X      =/,(l  +      +/;(2  +  ^l^j        +  +  1  +  '-^^^  +  ... 

=     { 1  +  ^  +  2 -  .)  (2  +  ^)  +  2^^  ) ^3  ^  n - 2^ 

+     +  2'  (n-g)  ^-+1)  r  7j^-  i\  I 

1.2.3  ...I  \  ^    ^     2    /  J' 

(1  +  2xY-^  =  |l  +  2(71  -        +  2- ^^'^      ^^^^        -  l)^;^        j  . 


Now 


Multiply  by  differentiate  both  sides  and  divide  by  2,  finally  putting  x  =  I,  and 
we  find 

3«-^  +  (n  -  s)3"-*-i  =  1  +  2(7i  -  s)f  +  2^     ~  ^n^^  ~    -  1)| 

1.2 

2'  (^^  -     (^^  -  s  -  1)  ••■  (n  -  g  -  ^  +  1)  ^-f2  „ 
1.2.3...*  2     ^  • 

Hence  we  deduce 


or 


/i  X  3"-'m,  =/i  (3"-*  +  {n  —  s)  S"-'"^  +  in3"-^}, 
=  1  +      +  i  (u  —  s). 
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But  the  mean  of  the  whole  population  of  offspring  is  at  1  +  I''-  ^lom  our  origin. 
Thus  we  have  the  final  results  : 

Mean  number  of  allogenic  couplets  in  offspring  of  fathers  with  s  allogenic  couplets 

=       —  ^{u  —  s)  allogenic  couplets. 

Deviation  from  mean  of  general  population  of  this  array  of  offspring 

=  lu  —  i  {n  -  s)  =  i\  (4.S  -  n). 

Deviation  of  fathers  from  mean  of  population 

=  s  —  i?^  =  ^  (4s  —  n). 

Thus 

Deviation  of  offspring  from  mean  of  population   ^ 

Deviation  of  fathers  from  mean  of  population  ^' 

We  have  then  the  following  results,  which  could  certainly  not  have  been  fore- 
seen  : — ■ 

(a.)  The  regression  is  constant  for  all  arrays,  or  the  regression  curve  is  a  straight 
line. 

(b.)  The  slope  of  this  straight  line  is  -3-,  or,  since  we  have  seen  that  the  })o})ulation 
is  stable,  the  parental  correlation  is  -3  also. 

Now  these  results  are  of  very  singular  importance.  A  very  general  theory  of  the 
pure  gamete  type  leads  to  linearity  of  the  regression  curve,  a  result  amply  verified 
by  observations  on  inheritance  in  populations  ;*  and  this  result  is  quite  independent 
of  the  number  of  couplets  sujjposed  to  form  the  total  character  of  the  parent,  or  of 
the  fact  that  in  this  case  the  arrays  of  offspring  are  skew  and  do  not  obey  the  normal 
law.t  Further,  the  value  of  the  correlation  reached  is  numerically  identical  with  the 
value  obtained  by  Francis  Galton  in  his  original  investigations  on  the  inheritance 
of  stature  !  The  generalised  theory  of  the  pure  gamete  is  thus  shown,  whatever  the 
number  of  couplets  taken,  to  lead  to  precisely  the  chief  results  already  obtained  by  those 
who  have  studied  heredity  statistically.  So  far  then  it  might  appear  that  a 
generalised  theory  of  the  pure  gamete  was  capable  of  being  brought  into  accordance 
with  the  chief  results  of  biometric  experience  in  heredity.  This  would  undoubtedly 
be  a  great  step  forward,  as  linking  up  perfectly  definite  inheritance  results  with  a 
physiological  theory  of  heredity.  Unfortunately  the  whole  drift  of  recent  biometric 
observations  on  heredity  emphasises  three  points : 

First. — That  the  parental  correlation  appears  to  be  markedly  greater  than  ^,  nearer 
to  "45  to  '5. 

♦  Galton,  '  Natural  Inheritance,*  p.  96  ;  '  Biometrika,'  vol.  '2,  pp.  216  and  362-3. 
t  This  is  further  demonstration  that  linearity  of  regression  has  nothing  whatever  to  do  with  the  Gauss- 
Laplacian  law  of  errors,  i.e.,  normal  curves  or  surfaces. 
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Secondly. — That  this  correlation  appears  to  vary  slightly   from   character  to 
character. 

Thirdly.— it  does  not  appear  to  be  absolutely  the  same  for  all  species. 

It  is  most  unfortunate  for  this  general  theory  of  the  pure  gamete,  tliat  it  throws 
the  Mendelian  back  into  the  position  of  the  biometrician  of  1885."*  One  might  have 
hoped  that  the  generality  involved  in  n  couplets  would  have  led  to  the  requisite 
elasticity,  or,  failing  this,  to  a  numerical  value  of  parental  correlation  nearer  the 
cluster  point  of  existing  measurements  than  \.  We  can  only  say,  at  present,  that  a 
generalised  theory  of  the  pure  gamete  leads  to  precisely  the  same  general  features  of 
]'egression  as  have  been  observed  by  the  biometricians,  but  it  appears  numerically  too 
narrow  to  describe  the  observed  facts. 


(7.)  Proposition  V. — To  find,  the  Standard  Deviation  of  the  Array  of  Offsj^ring 
due  to  Fathers  with  s~allogenic  Couplets. 

We  have  to  find  the  standard  deviation  cr,  of  the  combination  of  binomials  dealt 
with  in  the  previous  proposition.  Each  component  standard  deviation  must,  of 
course,  be  weighted  with  the  total  frequency  of  the  component,  and  there  must  be 
the  proper  reduction  to  the  mean  of  the  array  as  a  whole. 

The  [i  +  1)"^  binomial  {^u  +  |r')''"'  has  V  {n  —  i)^X^\  for  its  standard  deviation, 
and  the  distance  of  its  mean  from  the  mean  of  the  array 


=  {n  +  1  -     +  1  +  1  (n  -  i))}  -{^n-\  {n  -  s)} 

Further,  the  frequency  of  this  component  is 
We  thus  see  that  it  contributes 

to  the  total  second  moment  about  the  mean  of  the  array.    This  gives  us 
/,  X  3»-  X  cr/  =/,2o,,.,,„  2.-  {»-^'  +  (1^-  '  -  ij], 

*  "  Galton's  law  makes  the  amount  of  inheritance  an  absolute  constant  for  each  pair  of  relatives.  It 
would  thus  appear  not  to  be  a  character  of  race  or  species,  or  one  capable  of  modification  by  natural 
selection."  More  ample  statistical  experience  of  populations  since  1885  shows  that  absolute  constancy  of 
the  heredity  coefficients  is  not  consonant  with  actual  measurements.— '  Roy.  Soc.  Proc.,'  vol.  62,  p.  411. 

t  'Phil.  Trans.,'  A,  vol.  185,  p.  373. 

VOIv.  CCIII, — A,  K 
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or 

Now 

(1  +  2^)"-  =  Sc„_,,,,„(2x)', 

and  by  differentiating 

2  (m  —     (1  +  2xY-'-^  =  tc,^,^i^o2'ix'-\ 
Repeating  the  process 

4  (n  —  s)  (n  —    —  1)  (1  +  2,r)''-'-3  =  ^c,,_,,  „  2'{  (^■  —  1)  x' 


-2 


Hence,  putting  x  =  I,  we  have  the  required  expressions  on  the  right  of  the  above 
result,  or 

Therefore 

o-/  =z      —  i  (?i  —  s)  =  3L  (5w  +  4.s^). 

Now  the  standard  deviation  of  the  whole  population,  as  far  as  allogenic  units  are 
concerned,  is 

Thus 


This  result  is  of  singular  interest.  The  variability  of  an  array  of  offspring 
corresponding  to  a  father  of  given  allogenic  constitution  is  not  independent  of  the 
father,  but  increases  steadily  from  a  minimum  of  \/ -^n,  when  there  are  no  allogenic 
couplets  in  the  father,  to  a  maximum  of  \/^n  when  there  are  only  allogenic  couplets. 
In  other  words,  fixing  our  attention  on  the  same  character,  let  the  offsj)ring  of  the 
hybrids  inter  se  be  crossed  first  with  one  pure  race,  and  then  with  the  second  pure 
race,  i.e.^  first  with  pure  allogenic  and  then  with  pure  protogenic  individuals,  there 
ought  to  be  a  marked  difference  in  the  variability  of  the  resulting  offspring  in  the 
two  cases. 

Corollary  {]_.). — In  the  theory  of  linear  regression  as  apart  from  the  theory  of 
normal  correlation  on  the  basis  of  the  Gauss-Laplacian  distribution,*  if  cr  be  the 
standard  deviation  of  any  character,  and  r  its  correlation  with  a  second  character, 
then 

cr  \/l  —  r"'^ 


*  Yule,  'Journal  of  Royal  Statistical  Society,'  vol.  60,  December,  1897, 
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is  the  mean  standard  deviation  of  all  the  arrays  of  the  first  character  for  a  given  value 
of  the  second.  This  expression  is  no  longer  the  actual  standard  deviation  of  each 
array. 

It  is  of  interest  to  see  that  this  general  law  of  linear  regression  is  verified  in  the 
present  case.  We  have  o-  =  \/ -pg-n  and  r  =  ^.  Hence  if  be  the  mean  standard 
deviation  of  the  arrays,  we  should  expect 

%„i^  —  -iqTI  (1  "~  ^)  —  e^' 


Remembering  the  weight  of  each  array,*  we  have 

4"  X  4«  X  4"  X       =   S  j4"  X  i'%,s,o^^''-'  \ 

=  4"  X  4"  (1  +  3)"  +  4"  X  4"n4''-i  -3^6 
_  4;.     4«  X  4"  X  In, 


whence  2^;/  =  -^n,  as  we  anticipated. 

Corollary  (ii.). — It  is  clear  that  some  arrays  of  offspring  will  be  more,  others  less 
variable  than  the  general  population.  The  standard  deviation  of  an  array  will  be 
equal  to  that  of  the  general  population  when  s  is  found  from 

3^  (5n  +  4s)  =  or       5  =  i^g^. 

Now  the  mean  number  of  allogenic  couplets  in  the  general  population  is  ^-n.  Thus 
the  offspring  array  equally  variable  with  the  general  population  is  at  distance  -^gn 
from  the  mean.    But  o-  for  the  general  population  =  Hence,  if  we  take 

fathers  deviating  from  the  population  mean  by  xZ-^n  X  cr,  we  should  expect  their 
offspring  to  be  equally  variable  with  the  general  population.  Supposing,  therefore, 
the  theory  under  discussion  were  true,  we  should  have  a  means  of  finding,  at  least 
approximately,  the  number  n  of  couplets  corresponding  to  the  character  under 
consideration.  All  we  should  have  to  do  would  be  to  find  the  standard  deviation  of 
each  array  of  offspring  corresponding  to  a  given  father ;  these  standard  deviations 
ought  to  increase  or  decrease  steadily  across  the  table,  their  squares  giving  a  straight 
line  when  plotted.  Smooth  the  results,  interpolate  a  value  equal  to  cr,  and  we  shall 
have  the  character  of  the  father  whose  offspring  are  equally  variable  with  the  general 
population ;  but  the  deviation  of  this  father  from  the  mean  ought  to  be  sZ-^qIi  X  cr. 
Hence,  since  cr  can  be  found,  we  have  at  once  an  approximate  value  of  n.  Approxi- 
mate only,  of  coui'se,  because  our  arrays  are  classed  by  units  of  measurement,  inchesj 

*  The  number  of  olfspring  in  the  array  due  to  the  fathers  with  s-allogenic  couplets  is  found  at  once 
by  putting  u=  v  =  w  in  the  formula  of  Corollary  (i.)  on  p.  61,  and  equals     x  4" .  Cn, «,  o3"~*. 
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centimetres,  &c.,  and  each  such  unit  will  not,  as  a  rule,  represent  one  allogenic 
couplet ;  but  interpolation  ought  to  give  a  result  not  widely  divergent  from  the 
truth. 

The  method  would  of  course  fail  practically  if  rt  were  very  large.  For  example,  if 
n  were  48,  the  deviation  of  the  required  group  of  fathers  would  be  Sor,  and  hence 
such  a  father  would  only  occur  once  in  1000  individuals.  In  a  manageable  population, 
therefore,  we  are  very  unlikely  to  have  enough  such  fathers  to  form  any  reliable 
measure  of  the  variability  of  their  offspring.  At  the  same  time,  the  squares  of  the 
variabilities  of  the  arrays  of  sons  due  to  quite  frequent  fathers  ought  to  give  a 
straight  line,  and  if  this  line  be  determined  properly,  there  should  be  no  difficulty  in 
finding  the  theoretical  position  of  the  above  father,  and  so  finding  7i. 

Many  other  physiological  theories  besides  the  present  might  give  this  peculiarity 
of  the  diminishing  variability  of  the  arrays  of  offspring  as  we  pass  from  one  side  of 
the  correlation  table  to  the  other,  Such  changes  in  variability  are  familiar  to  those 
who  have  had  to  deal  with  skew  correlation.  But,  as  far  as  we  are  aware,  they  have 
iiot  hitherto  been  noticed  in  inheritance  tables.  The  existence  of  this  changing 
Variability  would  not  affect  in  any  way  the  general  theory  of  linear  regression  applied 
to  heredity  in  populations.  It  would,  however,  lead  to  an  immediate  extension  of 
that  theory  consisting  in  the  tabling  of  the  standard  deviations  of  the  arrays. 
Should  the  standard  deviations  of  these  arrays  show  no  bias  towards  a  linear 
distribution,  but  only  the  fluctuation  to  be  expected  from  random  sampling  about 
the  mean  value  a  ^  {I  —  /■-),  A\'e  should  have  a  strong  argument  against  the  present 
general  theory  of  alternative  inheritance.  We  seem  liere,  therefore,  to  have  a  crucial 
test  of  the  validity  of  the  theory,  which  may  be  quite  as  easy  to  apply  as  the  previous 
test  of  the  numerical  value  of  the  parental  correlation. 

Of  course  the  results  now  reached  are  not  consistent  theoretically  with  normal 
correlation  surfaces  with  their  elliptic  contour  lines.  The  fact  that  Mr.  Galton  came 
to  his  elliptic  contours  in  the  first  instance  on  the  basis  of  his  observations,  and  not 
from  any  theory,*  shows  that  they  must  in  the  case  he  was  dealing  with  be  approxi- 
mately correct.  Further,  there  is  no  doubt  that  in  other  statistics  for  characters  in 
man  there  is  within  the  limits  of  random  sam^jling  a  close  approximation  to  normal 
distribution.  It  might  be  hard  to  consider  that  such  a  deviation  as  would  arise  with 
a  continuously  increasing  variability  of  the  arrays  from  one  side  to  the  other  of  the 
table  could  exist  and  escape  notice,  had  we  not  in  physics  had  evidence  that  theory 
has  often  led  to  the  discovery  of  an  obvious  relation  which  time  after  time  must  have 
been  overlooked  by  previous  observers  unprovided  with  the  theoretical  hint  of  what 
to  seek  for.  Hence  while  we  may  say  that  the  parental  correlation  given  by  the 
theory  is  too  rigid  for  the  facts,  we  must  leave  this  second  test  until  more  care  ful 
examination  ad  hoc  has  been  made  of  the  ample  existing  data. 

•'  ■  *  'Natural  Inheritance,' p.  101. 
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(8.)  Pkoi'OSITION  VI.  —2b  Ji)bd  the.  Array  of  Offspring  due  to  a  G^xtndfather  of 
s-allogenic  Couplets,  supposing  Complete  Random  Mating  in  the  Popidation. 

The  general  distribution  of  the  population  is 

{a  +  2r  +  «')". 

The  fathers  with  y-allogenic  units  in  their  correlation  are  given  by 

The  array  of  oUspring  due  to  these  fathers  is  simply  obtained  by  writing  U  for  u, 
V  for  i\  and  W  for  iv,  and  multiplying  by  4"  X  4".  This  is  a  general  rule  for  getting 
the  offspring  from  any  father  if  he  mates  at  random.  It  gives  us,  as  on  p.  61,  for  the 
offspring  distribution 

4"  X      X  c„,,„,U^(2V  +  W)"-^ 
=  4"  X  4"  X  c„,,„  (in  +        {{^u  +  ^r)  +  {v  +  iv)Y-\ 

To  get  the  offspring  of  this  array  treated  as  fathers  and  mating  at  random,  we 
have  only  to  repeat  the  process,  and  we  find 

(3ffspring  of  grandfather  of  ^-allogenic  couplets 

=  4"  X  4"-  X  4"  X  4"c-,,,„(iU  -f         {W  +  fV  +  W]'-^ 

^  4'i"c„,  „ ,  (f n.  +  iy  +  ^,wy  (f n  +  J/vj  + 

where 

e  -  1^'  +  h^^ 

and  is  eqUai  to  unity  if  we  identify  v  and  w  as  something  not  allogenic.  This  can  be 
dealt  with  exactly  as  in  Proposition  IV.  we  dealt  with  the  array  of  offspring  due  to 
a  father  of  s-allogenic  couplets,  i.e.,  by  analysing  the  array  into  the  sum  of  a  number 
of  weighted  binomials  ;  in  this  case  all  skew. 

Writing  as  before,  'i?  =  i     +  w),  we  have  to  expand 

The  general  term  Is 

c,_,,„(|)V(i^*4-|e)-\ 

This -has  a  total  frequency  c.,,_^^ ^  (  f )'  X  f,  and  its  mean  is  at  a  distance  t+ 1  — i) 
from  the  origin  which  is  taken  at  {n  +  1)  allogenic  couplets. 
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The  total  frequency  of  the  array  is  (1  -^  Hence,  if  m',  be  the  mean  of  the 

grandchildren  measured  from  the  same  origin,  we  have 

/,  X  (f)"-^  X  m',  =f,  [1  +  in  +  i  {n  -  s)  {2  +  ^71  -  I)} 

^^yin-s){n-s-l)  ,3  _^  ^     -  2)}  .  .  . 

i  .  Z  ,  o  .  .  .  * 

=fi  mn  + 1)  X  (f) + u  0^  - iir-'-'}^ 

or 

m',  =  1  +  |;3;|(,,  _  ,)  =  1  +  |n  +  I  {n  -  5). 

Thus 

Mean  of  grandchildren  =       —  i     —  '^■)- 

Deviation  from  general  population  mean  =  -^71  —  ^  (n  —  s)  =      (45  —  n). 
Deviation  of  grandparent  from  general  population  mean  =  6  —  ^71  =  ^  (4s  —  n). 
Hence 

Deviation  of  offspring  _ 
Deviation  of  grandparent  ®' 

This  ratio  is  the  same  whatever  be  the  allogenic  constitution  of  the  grandparent. 

(9.)  Proposition  VH. — To  find  the  Ai^ray  of  Offspring  due  to  an  m^^  Great-grand' 
father  of  s- allogenic  Couplets,  sui^posing  Complete  Random  Mating  in  each 
Generation. 

The  array  due  to  a  father  of  s- allogenic  couplets  is 

4"  X  4"  X  c„,,,,  {1  {u  +  v)  Y  {I  (w  +  r;)  +  (v  +  iv)]"-^ 

and,  as  we  have  already  seen,  we  must  multiply  by  4"  X  4"  and  jjut  \  [u  +  v)  for 
^{u  -\-  2v  -\-  iv)  for  V,  and  -g  (v  +  iv)  for  w  to  get  the  array  due  to  the  grandparent  of 
s  allogenic  units.    This  process  must  be  repeated  w  times  if  we  wish  to  obtain  the 
array  due  to  the  ^n'"  great-grandparent. 

We  must  first  investigate  what  happens  to  ^  {u  4-  v)  if  this  interchange  be  made  m 
times.    Suppose  that  it  has  been  done  i  times,  and  let  the  answer  be 

M,i{u  +  v)  +  W,^{v  +  w). 

Repeat  the  operation,  and  the  expression  becomes 

(|M,  +  IM',)  4  {u  +  v)  +  (IM,  +  f  M',)  i  {v  +  iv), 


INHEEITANCE,  WITH  SPECIAL  REFERENCE  TO  MENDEL'S  LAWS. 


71 


or 

Therefore 

M,^i  +         =  M;  -1-  M',  =  Mo  +  M'o  =  1  +  0  =  1, 

Hence 

M',,!  -  i  =  i  (M';  -  1)  =       (M'o  -  i)  =  -  2^  • 


Hence,  finally, 


M  —  1  -!-  -        M'  —  1  —  ^ 


Thus  the  result  of  m  changes  on  |-  {u  -j-  v)  is  known. 
Similarly  the  result  of  m  changes  on  |-     +    )  is 

'2  2 

We  can  now  write  down  the  array  of  offspring  due  to  an  great-grandparent  of 
s-allogenic  couplets.    It  is 

We  must  now  find  the  mean  of  this  array.    For  brevity  let  us  write 
M„  1  (w  4-  v)  +  W,,  1  (y  +  lu)  =      H-  Xe 

(M,„  +  2M'„0  '^f-^  +  (]Vr„,  +         "LpR  ^  M^^M^,,  ^^^^        _^  ^^^^ 

where 

^  =  iM„,    \     1  -  iM^, 
1         ,  M',„ 


l-fM'/'^l+M^ 

^  2  (M.^  -  WJ)  ^  2  (M,„  -  M^.) 

+  2M',  1  +  M'„,  ' 

7)  =  ^{V  +  W) 
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Hence  we  have  to  find  the  mean  of  the  system 

{ixu  +  key  {fjLU  +  Xe  +  VT))"-', 
The      component  binomial  of  tliis  sum  of  binomials  is 

c,n_,;,,i.V(i^w  +  Xe)''-'. 
It  therefore  has  its  mean  at  a  distance 

+  1  +  X  {n  -  r) 
from  {n      1)  allogenic  couplets,  and  a  frequency  given  by 

The  total  frequency  of  the  whole  array  =  (I  -f-  v)"~\fi. 

Hence,  taking  moments  round  the  origin  at  n  +  1  allogenic  couplets,  we  have,  if 
m',  be  the  mean  of  the  array, 

(1  +  ^y-'A  X  m[..  =/,  1 1  +  hi  +  i^{n-  s)  (2  +  X  {n  -  1)) 
+  ^3  (3  +  X  (?i  -  2))  +  . . . 

.    i(n  —  s)(n  —  s  —  l)...(n  —  s  ~  i  4-  1)       ■    •  ,  x/        w  ,  1 

+  1  "2  V   7   (I  +  ^  +  ^(»^  -  *))  + 

Summing  and  dividing  by  {1  +  vy~'  we  find 

,n'^  =  1  +  Xn  +  in  -^^  H^^A) . 

Hence  the  mean  number  of  allogenic  couplets  in  the  members  of  the  array 
=  n  -f  1  _  m',  =  n{l  -\)  —  {n  —  s)  (1  —  X). 

Deviation  of  offspring  from  mean  of  general  population 

_  Wij-:X)_    h  _  i_-x 


We  now  substitute  for  v  and  X  in  terms  of  M„,  and  M'^,  and  find 

^(L—J^)  -  1  /M  _  MM  —  i  1 
I  -\-  p    ~  3  \^^->-m      iVi  „,)  —  3  -^^^ 

12  ^^2"" 
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Hence  :  Deviation  of  offspring  =  -  ;  but  the  deviation  of  m'""  great-grand- 

parent =    —  ^n. 
Thus  we  have 

Deviation  of  offspring   _  j  1  _ 

Deviation  of  m"'  great-grandparent  2'" 

This  result  is  inde|)endent  of  .v  and  of 
Thus  we  conchide  : 

(i.)  The  regression  of  offspring  on  any  individual  ancestor  is  linear  ; 
(ii.)  The  correlation  coefficient  is  halved  at  each  stage  in  ancestry ; 
(iii.)  The  result  is  perfectly  independent  of  the  number  of  couplets  introduced  into 
the  formula. 

The  first  two  results  are  very  familiar  to  biometric  workers  in  heredity. 

The  actual  numerical  values  of  tlie  grandparental,  great-grandparental,  great-great- 
grandparental  correlations  are  ^,  -^.^^  i/j,  &c. 

These  are  distinctly  less  than  the  values  so  far  reached  for  ancestral  correlation, 
the  grandparental  correlations,  for  instance,  lying  between  "2  and  "3. 

The  results  show,  however,  that  a  general  theory  of  the  pure  gamete,  embracing 
the  simpler  forms  of  the  Mendelian  principle,  leads  us  directly  to  a  series  of  ancestral 
correlations  decreasing  in  a  geometrical  progression.  Thus,  when  we  suppose  a 
population  arising  from  hybridisation  to  cross  at  random,  we  find  that  it  obeys  the 
second  fundamental  assumption  of  the  biometric  theory  of  heredity.  In  other  words, 
ancestry  is  of  the  utmost  importance,  and  the  population  follows  laws  identical  in 
form  with  those  propounded  in  the  biometrical  theory  on  the  basis  of  a  linear 
regression  multiple  correlation.  Only  the  values  of  the  constants  deduced  for  the 
law  of  ancestral  heredity  from  the  present  theory  of  the  pure  gamete  (which  appears 
to  cover  the  bulk  of  Mendelian  formulae  hitherto  propounded)  are  sensibly  too  small 
to  satisfy  the  best  recent  observations  on  inheritance. 

It  is  of  interest  to  find  "Mendelian  Principles"  when  given  a  wide  analytical 
expression  leading  up  to  the  very  laws  of  linear  regression,  of  distribution  of 
frequency,  and  of  ancestral  inheritance  in  populations,  which  have  been  called  into 
question  as  exhibiting  only  a  blurred  and  confused  picture  of  what  actually  takes  place. 

It  would  be  an  immense  advantage  if  we  could  accept  such  a  theory  of  the  pure 
gamete  as  has  been  here  analysed  as  a  physiological  basis  for  the  theory  of  heredity. 
We  should  then  have  a  physiological  origin  for  the  ideas  of  regression  and  of  ancestral 
inheritance  which  statistics  of  heredity  in  populations  have  made  familiar  to 
biometric  workers.  Unfortunately,  even  such  a  general  pure  gamete  theory  as  we 
have  here  dealt  v/ith,  while  leading  to  results  which  form  a  special  case  of  the  law  of 
ancestral  heredity,  is  not  sufficiently  elastic  to  cover  the  observed  facts.    The  lesson 

*  '  Biometrika,'  vol.  2,  p.  220. 
VOL.   CCIIL  A.  L 
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to  be  learnt  from  the  present  investigation  is,  however,  that  there  is  no  essential 
repugnance  between  any  of  the  main  results  of  the  biometric  school  and  a  theory  of 
the  pure  gamete,  but,  on  the  contrary,  it  is  perfectly  possible  to  test  such  theories  by 
biometric  methods.  We  may  fairly  ask  anyone  wlio  propounds  in  future  a  Mendelian 
or  pure  gamete  formula  as  a  general  theory  of  heredity,  to  remember  that  it  involves 
in  itself  definite  laws  regulating  the  reproduction  of  a  population  mating  at  random, 
and  that  it  is  incumbent  on  the  propounder  to  test  whether  or  not  such  laAvs  are 
consistent  with  what  we  already  know  of  the  inlieritance  statistics  of  such  popula- 
tions. When  we  remember  that  deducing  all  the  effects  of  such  a  formula  within  the 
whole  field  of  inheritance  will  almost  always  form  a  very  laborious  piece  of 
mathematical  analysis,  there  seems  a  touch  of  scientific  irresponsibility  in  jjropounding 
an  immense  variety  of  formulae  to  suit  one  or  other  special  case,  and  the  modifying  or 
withdrawing  them  when  they  are  found  to  fail  in  another. 

(10.)  Proposition  VIII. — To  find  the  Regression  and  Correlation  of  Brethren 
on  the  Theory  developed  in  this  Paper. 

We  shall  suppose  the  group  of  brethren  to  consist  of  members,  or  any  pair  of 
parents  to  have  a  family  of  4;^. 

Consider  first  parents  of  one  couplet  only,  the  otfsj^ring  of  the  16  possible  pairs  are 
given  in  the  table  below  : — 


Father. 

a  +  a. 

ft  + A. 

A  +  «. 

A  +  A. 

f  a  +  a 
^  1  «.  +  A 
1  1  A  +  «, 

La  +  a 

4xy 

2  X  + 
x(«  +  2y  +  zc) 
X  {H,  +  2r  +  -tv) 

2x{o  +  w) 

2x{v+i>) 
x(«  +  2/'  +  ;(') 
X{i'-  +  2v  +  a-) 

2x{io  +  ii) 

4xy 

2x  (';  +  «•) 
2x  {iv  +  v) 
4:xw 

NoAv  let  us  take  every  pair  of  brothers  out  of  each  of  these  16  families  and  form  a 
correlation  table  of  brothers.  The  following  table  gives  the  distribution  of  the 
various  types  : — 


First  brother. 

V. 

w. 

Second 
brother 

4x(9x-4) 

24r 
4x2 

24x- 
80x^-32x 
24x2 

4x2 
24x- 
4x(9x-4) 
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This  gives  a  total  of  256x'  -  G4x=  16  X  4x  (4x  -  1)  pairs  of  brothers,  as  it  should, 
every  brother  in  16  families  having  4x  -  1  brethren,  and  there  being  4x  in  each 
family.    Clearly  we  can  divide  by  4x,  and  we  have  the  simplified  form  : — 


«1. 

W-2 

9x-4 
6x 
X 

6x 
20X-8 

6X 

X 
6x 
9x-4 

where  the  subscripts  1  and  2  refer  to  the  first  and  second  brothers. 

Let  us  simplify  this  by  considering  only  the  allogenic  elements,  t)  denoting  either  a 
heterogenic  or  a  protogenic  element.    Then  we  have  : — ■ 


m 

9x-4 
7x 

7x 
41x-12 

Thus  the  distribution  of  pairs  of  brothers  in  the  case  of  the  character  being  fixed 
by  a  single  couplet  is 

(9x  -  4)  u^u,  +  7x«i^7:2  +  ^X%^i  +  (41x  -  12)  r]^y}.. 

This  is  to  be  read  as  follows  :  there  are  9x  —  4  cases  of  both  brothers  being 
allogenic,  to  4 Ix  —  12  cases  of  neither  brother  allogenic,  to  7x  +  7x  cases  of  one  only 
of  the  two  brothers  being  allogenic. 

Now  when  we  pass  from  a  character  fixed  by  one  couplet  to  a  character  fixed  by 
two,  the  above  distribution  can  occur  in  either  couplet,  and  ev^ery  possible  pair  of 
brothers  will  be  got  by  squaring  the  above  ex^jression.  Proceeding  in  this  way  to 
n  couplet  characters,  we  have  the  following  symbolic  expression  for  the  distribution 
of  brothers 

{^XY  X  [9x  -  ^)u,u,  +  7x%^o  +  7xu.^,  +  (41x  -  12)7^,7^,}\ 

the  omitted  factor  4x  being  restored. 

This,  I  think,  represents  the  distribution  of  a  character  measured  by  allogenic 
couplets  in  a  population  of  pairs  of  brothers,  i.e.,  is  the  correlation  table  for  brothers 
in  the  population — any  term  involving  u/Uc^^  representing  when  we  put  yji  and  173 
equal  to  unity  the  number  of  pairs  in  which  the  first  brother  has  p  and  the  second  q 
allogenic  couplets  in  their  constitutions. 

Hence  if  we  find  the  coeflicient  of  u/  in  terms  of  u.2,  we  shall  have  the  array  oi 

L  2 
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brothers  to  Ije  found  associated  with  a  brother  of  p  allogenic  couplets.  The  above 
expression  may  be  written 

(4x)"  [((9X  -  4)  n,  +  ^xl^  \  «i  +  {7x%  +  (41x  -  12)  771I  ^J". 
The  term  involving  u-^'  is 

W  {(9X  -  4)  u,  +  7xniV  [7x^'3  +  (41x  "  12) 
Neglecting  the  constant  factor,  the  distribution  in      is  given  by 
{(9x  -  4)^,  +  7xmY  {7x^H  +  (41x  -  12)^71]" 


We  require  to  find  the  mean  of  this  array. 
Put 

X=^X-4  ^  =  _^X„,       ,  =  4(5X-1) 

16X-4'  16x-4'  7x 

Then  again,  l)ut  for  a  factor  independent  of  the  power  of  u^,  the  array  may  be 
read 

The  general  term  is  therefore 

and  we  must  sum  from  s  =  0,  to  s  —  n  —  p. 

The  general  term  has  therefore  its  mean  at  the  distance  I  +  /a  (^9  +  ^)  from 
p  +  .s"  +  1  allogenic  couplets,  or  its  mean 

=  (1  —  ^J^)  {p  -\-  s),  allogenic  couplets, 

and  its  total  frequency  =  i'""^"*  c,i_^,_  s,  o- 

This  gives  a  total  frequency  of  the  array  proportional  to  (1  +  vY~^\ 
Hence,  taking  moments,  we  have  for  the  mean  m  of  the  array  given  l)y 

w  X  (1  +  v)'"-^  =         {(1  -  /.)  {p  +  s)  i'''-''-'c,,_,^s,o] 

=  (1  -  l^)p  (1  +  v)"'-^'  +  (1  -  /^)  {n -p)  (1  +  r^r'-'- 


Therefore 


1  —  a     ,  V  (1  —  ix) 

on  =  L-  71  H  ^  p  . 

1  +v     ^     1  +v  ^ 


Hence  we  see  that 

(i.)  The  regression  between  brothers  is  linear. 

(li.)  The  fraternal  correlation  which  is  equal  to  the  regression  = 

3(4x-l)' 
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aud  is  quite  independent  of  the  numbei'  of  couplets.  It  is,  however,  a  function  of  x» 
the  size  of  the  family  used  in  forming  the  table.    We  have  tlie  following  values  : — 


Size  of  family. 

Value  of  X- 

Value  of  fraternal 

8 

X  =  2 

•3333 

12 

3 

•  3636 

16 

4 

•3778 

20 

5 

•3860 

24 

6 

•3913 

32 

8 

•3978 

40 

10 

•4017 

00 

CO 

•4067 

The  value  of  fraternal  correlation  thus  varies  with  the  size  of  the  family  dealt  with 
from  '3  to  "4.  Probably  the  more  correct  way  of  looking  at  any  fraternal  correlation 
table  would  be  to  suppose  it  a  random  sample  of  all  the  pairs  of  brothers  which  would 
be  obtained  by  giving  a  large,  or  even  indefinitely  large,  fertility  to  each  pair,  for 
what  we  actually  do  is  to  take  families  of  varying  size  and  take  as  many  pairs  of 
brethren  as  they  provide.  In  this  case  we  ought  to  reach  a  fraternal  correlation  of  "4, 
precisely  the  value  reached  by  the  ancestral  law  when  we  take  Francis  Galton's 
original  series  of  ancestral  correlations.* 

Thus  we  conclude  that  on  the  general  theory  of  the  pure  gamete  here  dealt  with, 
the  fraternal  correlation  is  slightly  larger  than  the  parental.  This  is  in  accordance 
with  the  general  result  of  biometric  investigations  on  populations.  But  the  value,  as 
in  the  case  of  the  parental  correlation,  is  very  sensibly  lower  than  the  value — about 
"5 — found  from  recent  investigations  on  man.  It  is  further  very  inelastic  even  if  we 
allow  for  some  variation  in  the  size  of  families  dealt  with.  There  can  be  little  doubt 
that  fraternal  correlation  varies  from  character  to  character  and  species  to  species  in 
a  manner  sensibly  beyond  what  can  l^e  accounted  for  by  differences  in  the  size  of  the 
family  dealt  with.j 

Corollary. — We  can  exhibit  the  regression  in  the  form  : 

Mean  of  array  —  mean  of  general  population 


1  -\-v 


{deviation  of  brother  from  mean  of  general  population}, 


by  observing  that  1  — ^  =  1(1  -\-  vjx),  whence 


—  4      T     I  ^    "t  I 


*  '  Roy.  Soc.  Proc.,'  vol.  62,  p.  410. 

t  There  is  sensible  variation  even  for  different  characters,  when  we  take  the  same  series  of  pairs  of 
brothers,  and  only  one  pair  from  each  family. 
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or 


m 


hi  = 


1  +^ 


(  P  -  in). 


(11.)  Pkoposition  IX. — To  find  the  General  Formula  for  BijKirental  Regresftion  on 
the  Theory  of  the  Pure  Gamete,  and  the  Valve  to  he  given  to  the  "  Midparent." 

If  we  applied  without  further  consideration  the  general  formula  for  biparental 
regression  to  this  case,  we  should  have,  if  mp,j  be  the  mean  of  the  offspring  due  to 
fathers  of  jj-allogenic  couplets,  mated  with  mothers  of  fy-allogenic  couplets, 

mj,,j  =  \n  +  i  ( />  -  \n)  -\-  \{q  —  {n). 

This  follows  at  once,  since  the  mean  of  the  general  pojDulation  =  \ri,  the  regression 
coefficient  for  either  parent  =  \,  and  there  is  no  assortative  mating. 
Hence  we  should  have 

"ly^pri^-hn  -\-  \{p  ^-  q). 

Now  suppose  both  parents  of  pure  allogenic  race,  then  p  =  <i  ■=  n,  and  all  the 
offspring  will  be  of  pure  allogenic  race,  or  we  must  have  nij^y  —  n. 
But  the  above  formula  gives 

Avhich  is  not  correct. 

In  other  words,  while  the  above  formula  gives  the  best  plane  to  fit  the  array  of 
points  determined  l)y  the  parental  constitutions,  that  array  of  points  does  not  truly 
lie  in  a  plane.  Or,  although  the  simple  regressions  are  linear,  the  compound 
regression  is  not  truly  planar.  We  have  therefore  to  find  its  true  form,  and  measure 
the  amount  of  deviation  from  the  truth  involved  in  using  a  biparental  formula  of  the 
type  indicated. 

Given  a  character  resulting  from  n  couplets,  we  require  4"  X  4"  individuals,  4"  male 
and  4"  female,  to  form  tlie  whole  possible  system  of  random  matings.  In  such  a 
population  there  would  be 

fv/,/),r/3"~^  fathers  of  p-allogenic  couplets, 

and 

Ch,,/,o^"~'^  mothers  of  (/-allogenic  couplets. 
The  chance  therefore  of  a  mating  of  a  |?-allogenic  father  and  a  (/-allogenic  motlier  is 

r  f  ^iti—p-rjlAii' 

and  when  n  be  even  moderately  large,  this  gets  very  small  if  p  and  q  at  all  nearly 
approach  n.  For  example,  if  n  were  5,  and  father  and  mother  were  both  pure 
allogenes,  the  chance  of  such  a  pure  allogenic  mating  would  be  only 

1/1,048,576, 

or  in  a  population  of  a  million  would  hardly  occur  once. 


INHERITANCE,  WITH  SPECIAL  REFERENCE  TO  MENDEL'S  LAAVS. 


79 


Still  keeping  n  =  5,  take  'p  =  (j  =  =  4.  Then  we  have  the  chance  of  such  a 
mating 

1=  1/4215 

still  extremely  improbable. 

Thus  when  «  is  even  moderately  large,  pure  allogenic  matings  are  so  rare  that  they 
have  vanishingly  small  influence  in  the  population  at  large.  Even  if  n  were  2,  the 
chance  of  a  pure  allogenic  mating  is  only  -^^q.  These  points  nmst  be  borne  in  mind 
in  what  follows. 

Consider  first  a  father  and  a  mother  of  one  couplet  each,  their  zygotes  are  either  u, 
n  or  w,  involving  a  gametic  constitution  a  +  a,  a  +  A  or  A  +  A.  We  have  the 
following  scheme  : — 


Zygote  of  father. 

Zygote  of  mother. 

Number  of  matings. 

Offspring. 

H 

u 

1 

4?f 

n 

V 

2 

2  (//  +  /■) 

It, 

w 

1 

\r 

r 

u 

2 

•2{v+ii) 

r 

n 

4 

11  +  2p+  w 

n 

w 

2 

2  (w  +  v) 

w 

n 

1 

2  (y  +  v) 

IV 

V 

2 

2  (w  +  v) 

v: 

w 

1 

4:W 

Hence  it  there  be 

1  allogenic  couplet  in  father  and  1  in  mother,  offspring  =  Au, 

1  0  r=  4  {u  +  2v), 

0  1  =4  {2v  +  1.1), 

0  0  .'     =  4(?(  +  4i'  +  iw). 

Let  us  write 

16^0     16  {III  +  V  +  lu),        16j,  =  16  {iu  +  iy),        16^2  =  I6^u. 

Then  consider  the  relation 

(lA'  +  it'e')  X  {a'^^rj^>  +  n'^)  =  4^  {j,,^'  +  Ji  i^'v'  +  +  jW^'), 

where  e  and  rj  are  mere  symbols,  and  0,  1,  etc.,  denote  their  powers,  u,  u'  refer 
respectively  to  father  and  mother,  and  their  powers  denote  the  number  of  allogenic 
couplets  in  the  zygotes  of  father  and  mother.  Then  the  above  is  a  symbolical  relation 
which  gives,  by  equating  any  power  or  product  of  e  and  on  either  side,  the  oflPspring 
of  a  pair  of  parents  of  definite  constitution. 

Now  suppose  the  parents  not  to  consist  of  a  single  couplet,  but  of  n  couplets,  then 
the  total  distribution  of  offspring  that  we  have  given  above  for  any  couplet  may  occur 
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in  each  couplet,  and  each  sncli  distribution  must  he  combined  ^vith  every  otlier  couplet 
distribution.  We  then  reach,  dropping  unnecessary  indices,  the  general  symbolic 
relation, 

(vi'^  +  v/e  +  v-e-  +  .  .  .  +  /'"e") 
X  (a''  +  u'^T)  +  //y  +  .  .  .  +  v/''r}") 

=  r  X  4"  X  Uo+jA'  +  v)  +  i:^vY- 

Thus  the  array  ol'  offspring  due  to  parents  of  zygotes  with  and  q  allogenic 
couplets  respectively — i.e.,  to  W'X'iif'' — is  the  coefficient  of  €''17'  on  the  right-hand  side, 
or  in  the  expansion  of 

4"  X  4"  X  (,7o+,/i(e+7;)+,;>77)". 

This  may  be  written 

4"  X  4"  X  {j,  +  ,/\>7)e  +  ./„  + 

Thus  the  coefficient  of  e''  is 

4"  X  4"  X  (ii  +i2r?)"(./o  + 

We  require  to  pick  the  coefficient  of  rj''  out  of  this  in  order  to  get  the  array  ot 
offsjDring  due  to  fathers  of  p,  and  to  mothers  of  q,  allogenic  couplets.  But  this  is 
clearly 

4"   X    4"   X   '-'n,,,,o{j-lJ\'    Vo"  ^'^P,'hO 


+  ...], 

or,  more  briefly, 

4"  X  4"  X  c,^,^,^Eur'jr''''''jr'~'o,,.,,,..,c,^,^_,^,^. 

We  shall  first  find  the  mean  and  frequency  corresponding  to  the  r*''  term  as  given 
above  of  this  series.    What  we  have  to  deal  with  is 

{\u)'"-  (in  +  ^r)'-^^-^'-  {^u  +     +  ivy-'-\ 

We  may  write  this 

3_^-^^_^.  ^  ^^^^  ^  ^  ^^^^  ^      _^  2x')"--", 

where  x  —  "  '^^^^  X  —  i'^'  +  I^^S  ^^^1  both  may  be  put  unity  when  we  are  merely 
finding  the  distribution  of  allogenic  couplets. 

Now  the  general  term  in  the  above  expression  is 


4'* 


n— }>—)',  s,  0) 
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and  6^  must  be  taken  from  o  to  n  —  p  —  r.    The  frequency  of  this  term  is 

0)1-1/  +  )- 

and  its  mean  m  q  —  r  +  ^  {p  —  q  -\-  2r  -\-  s)  allogenic  couplets. 
The  total  frequency  of  the      term  is  therefore 

^  (1  +  2)"-'-'  =  ^—  , 

which  of  course  must  ultimately  be  multiplied  by  the  factorials  in  r  omitted  above. 
If  m,  be  the  mean  number  of  allogenic  couplets  in  the  r*"  term,  we  have 

q3«— p  — 5         «  =  «— p— ?■  /Oii  —  q  +  r 

4  s=o     \  4" 

=  VL-v^\{p-q\ 2r)}  J^\{n-p-  r) 

Thus  : 

wi,,  =     _  7'  +  i  (j^J  -  ^  +  2r)  +  i  (?2  -  p  -  r), 

=  i''^  +  |g  +  |p  -  l^'- 
This  is  the  mean  of  the  r*'^  term,  and  its  total  frequency  is 

gSrt  —p  —  q 

4     X  4     X  Cn,p,r  X  C.p^^_j,^„. 

Hence,  if 

F  =  4"  X  32"-^-^/s  c,,,^,,  X  c,,,_,.  „ 

we  shall  have 

fx  m,,,  =  S  [{^n  +  |g  +  |p)  -  |r]  c„,,,, ,  X  , 

)-=0 

where  ?7i^j^  is  the  mean  of  the  array  of  offspring  due  to  fathers  of  p  and  mothers  of  q 
allogenic  couplets. 

The  only  difficulty  here  is  summing  the  series 

r=q 

But  this  may  be  written 

S  {ll    X    C,i_i^j,  X    C^,,  ry-l-r+l,  0)5 

5'— 1=0 

or  multipled  by  i"-^^^i--^)-i'-{9-i)     it  represents  the  total  offspring  of  fathers  of_p  and 
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mothers  of  q  —  I  allogenic  units  in  a  population  with  [n  —  1)  couplets  in  their 
constitution 

Hence 

,-l=r,-l 

S        [n  X  C,i_i_  X  Cj),  0}   =  W  X  ('n-l,p,o  X  ^n-l,  q-l,  n  - 

7'—  1  =  0 

Now 
Thus 

or,  if  f  denote  the  above  series,  we  have 
This  leads  us  to 


=  i«  +  i(i'  +  <?)  +  i^ 

_  ,  (n  +  2yj)  {n  +  2^) 


This  is  a  most  remarkable  result,  for  it  shows  that  the  regression  surface  is  not  a 
plane  but  a  hyperboloid.  Let  us  measure  all  the  quantities  in  deviations  from  the 
mean  of  the  general  jDopulation,  i.e.,  put  m^j  =  m'pg  +  ^n,  'p  —  p'  -\-  -^n,  q  =  q'  +  |n- 
We  find 

«^Vy  =  i(i>'  +  ?')  +  t^' 


This  formula  reconciles  at  once  the  Mendelian  and  Galtonian  j^ositions.  When  the 
number  of  couplets  is  large,  parents  having  a  number  of  allogenic  couplets  comparable 
with  n  are  vanishingly  small  in  number.  The  standard  deviation  cr  of  the  population 
is  \/  3n/4  (see  p.  57). 

Hence  we  may  write 

/   _  2  />'  +  g'\  ,   P'  . 

'  1"!  —  3  ^  /tt: 


m' 


2     J      ^a-  \/Sn 


But  p'  can  only  once  per  thousand  cases  be  as  big  as  3cr,  and  accordingly  when  n  is 
large,  both  terms  of  the  product  will  be  small.  In  this  case  the  surface  becomes 
practically  planar,  and  we  have  the  Galtonian  result  of  1885,*  that  the  offspring  from 
the  Galtonian  "  midparent"  are  one-third  nearer  the  general  mean  of  the  population. 

On  the  other  hand,  when  n  is  small  we  see  that  for  midparents  not  differing  too 
widely  from  the  population  mean,  Galtonian  regression  of  the  value  ^  holds,  but  that 

*  '  Natural  Inheritance,'  p.  97. 
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as  we  pass  away  from  the  mean  of  the  population  the  regression  gets  less  and  less, 
becoming  absolutely  zero  when  we  take  two  pure  allogenic  parents.  Just  as  the 
regression  is  reduced  when  we  emphasise  the  allogenic  constitution  of  the  parents,  it 
is  increased  when  we  emphasise  the  non-allogenic  elements. 

To  illustrate  these  points  I  have  drawn  a  diagram  for  ri  =  9  of  the  contour  lines  of 


Midparental  contours. 


the  regression  surface  as  a  plane  and  a  hyperboloid.  The  area  ABCD  contains  all 
possible  parents  ;  999  out  of  1000  random  matings  fall  in  the  loop  ahcd  ;  99  out  of 
100  random  mating  within  the  loop  a'h'cW,  which  has  been  carried  right  round  to 
mark  that  it  excludes  matings  in  which  both  parents  have  no  allogenic  couplets.  The 

M  2 
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means  of  the  array  of  ofispring  due  to  any  pair  of  parents  are  marked  in  slim  figures 
along  the  Mendelian  hyperbolic  contours,  and  in  heavy  figures  along  the  Galtonian 
straight  lines.  We  see  comparatively  small  differences  as  long  as  we  deal  with 
matings  within  the  99  per  cent,  loop,  somewhat  greater  differences  as  we  approach 
the  99*9  per  cent,  loop,  and  very  marked  differences  as  we  go  beyond  the  latter 
boundary  towards  pure  allogenic  parentage.  A  study  of  the  diagram  illustrates  at 
once  how  the  Mendelian  theory  exhibits  for  the  bulk  of  the  population  Galtonian 
regression,  such  regression  becoming,  however,  less  and  less  as  we  j^roceed  to 
individuals,  the  frequency  of  whose  matings  may  be  less  than  one  in  a  million. 

It  will  be  seen  again  that  in  tliis  proposition  we  have  no  fundamental  antagonism 
between  the  Mendelian  and  biometric  standpoints.  We  reach  a  single  formula  which 
approaches  more  and  more  closely  to  the  biometric  standpoint  when  we  deal  with 
characters  depending  on  many  allogenic  couplets.*  On  the  other  hand,  it  gives  the 
absence  of  regression  which  is  obtained  when  pure  allogenic  parents  are  mated.  We 
see,  however,  quite  clearly  that  it  is  totally  erroneous  to  argue  from  this  single  case 
against  regression  in  general.  Such  regression  actually  exists  on  "  Mendelian 
Principles  "  when  any  population  breeding  at  random  is  taken,  and  involves  in  itself 
the  whole  conception  of  ancestral  correlation  and  the  influence  of  ancestry. 

Of  the  formula  for  the  midparent  now  reached,  however,  we  can  only  say  that  on 
the  basis  of  our  experience  in  populations  the  factor  -f  seems  too  inelastic  to  work. 
Here  again  the  data  must  be  especially  investigated  from  the  standpoint  of  a 
midparent  given  by 

+  + 

before  judgment  can  be  final  on  this  test. 

Theoretically,  by  assuming  the  midparent  to  be 

hip' +  </)^xv'<l'^ 

we  should  have  a  means  of  finding  ^  by  averages,  and  therefore  n,  the  number  of 
couplets  involved. 


(12.)  Geneml  Conclusions. 

In  this  paper  we  have  dealt  with  a  general  theory  of  the  pure  gamete — possibly 
not  the  widest  that  could  be  conceived — but  sufficiently  wide  to  indicate  the  real 
bearing  of  Mendelian  formuke  when  applied  to  a  population  mating  at  random.  We 
see  that  under  such  circumstances  : 

(i.)  The  population  which  results  from  the  offspring  of  hybrids  remains  stable; 
every  variation  which  appears,  appears  with  a  certain    definite  and  predicable 

*  For  «-  =  50  or  100  the  population  within  the  1  in  1000  line  sensibly  obeys  oi  dinary  linear  midparental 
regression. 
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frequency  ;  there  is  no  room  for  the  ajjpearance  of  "  mutations,"  although  certain 
variations  with  very  small  frequency  would  be  extremely  rare  in  a  limited  population. 
A  mutation — a  variation  not  hitherto  observed — would  only  appear  in  the  offspring 
of  tlie  hybrids  between  two  pure  races ;  after  this  with  random  mating  the  mixed 
race  would  remain  perfectly  stable  until  disturbed  by  sexual  or  natural  selection. 
These  are  the  only  mutations  which  arise  on  the  generalised  theory  of  the  pure 
gamete,  i.e.,  two  pure  races  form  one  mixed  race,  breeding  true  to  itself;  it  is  difficult 
under  these  circumstances  to  account  for  the  origin  of  the  two  pure  races  by  a 
mutation-theory  of  the  differentiation  of  species  ! 

(ii.)  Between  any  two  relations — if  we  measure  the  character  by  the  number  of 
allogenic  or  protogenic  couplets  in  the  zygote  of  the  individual — we  have  a  linear 
regression.  The  frequency  distribution  of  any  character  is  skew,  approaching  closely 
the  normal  distribution  as  the  number  of  couplets  which  determines  the  constitution 
of  the  zygote  is  increased. 

(iii.)  The  correlations  between  pairs  of  blood  relations  take  definite  numerical 
values  absolutely  independent  of  the  number  of  couplets,  and  the  same  for  all 
characters  and  races. 

(iv.)  The  ancestral  correlations  form  a  geometrical  series  of  common  ratio  one-half 
(v.)  Fraternal  correlation  is  fixed  between  narrow  limits  depending  on  the  number 

of  brothers   per   family  dealt  with,  and  is  very  slightly  larger   than  parental 

correlation. 

(vi.)  The  theory  of  the  midparent  for  a  considerable  number  of  couplets  approaches 
closely  that  originally  given  by  Francis  Galton,  except  for  extreme  values  of  the 
character,  when  the  regression  becomes  rapidly  smaller  and  ultimately  vanishes. 

We  thus  see  that  a  generalised  theory  of  the  pure  gamete  would  be  of  very  great 
advantage  if  it  could  be  accepted.  It  would  lead  to  a  system  of  inheritance  in 
randomly  mating  populations  with  non-differential  fertility,  which  in  its  broad 
features  would  be  essentially  the  same  as  that  which  has  been  biometrically 
developed  not  from  theoretical  hypotheses,  but  from  the  statistical  description  of 
observed  facts  in  populations. 

Unfortunately,  however,  when  we  come  to  the  actual  numerical  values  for  the 
coefficients  of  heredity  deducible  from  such  a  theory  of  the  pure  gamete,  they  do  not 
accord  with  observation.  They  diverge  in  two  ways  :  First,  they  give  a  rigid  value 
for  these  coefficients  for  all  races  and  characters — a  result  not  in  reasonable 
accordance  with  observation.  Secondly,  they  give  values  distinctly  too  small,  as 
compared  with  the  average  values,  or  with  the  modal  values  of  large  series  of 
population  observations. 

We  tlius  reach  the  point  we  have  so  often  had  to  insist  upon  :  that  the  biometric 
or  statistical  theory  of  heredity  does  not  involve  a  denial  of  any  physiological  theory 
of  heredity,  but  it  serves  in  itself  to  confirm  or  refute  such  a  theory.  Mendelian 
formula  analytically  developed  for  randomly  mating  populations  are  either  consistent 
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or  not  with  the  biometric  observations  on  such  populations.  If  they  are  consistent,  it 
shows  their  possibihty,  but  does  not  prove  their  necessity.  If  they- are  not,  it  shows 
they  are  inadequate.  The  present  investigation  shows  that  in  the  theory  of  the  pure 
gamete  there  is  nothing  in  essential  opposition  to  the  broad  features  of  linear 
regression,  skew  distribution,  the  geometric  law  of  ancestral  correlation,  etc.,  of  the 
biometric  description  of  inheritance  in  populations.  But  it  does  show  that  the 
generalised  theory  here  dealt  with  is  not  elastic  enough  to  account  for  the  numerical 
values  of  the  constants  of  heredity  hitherto  observed. 

It  will  be  time  enough  to  consider  other  more  or  less  general  Mendelian  formulfB 
when  there  is  far  better  evidence  than  exists  at  j^resent  that  they  cover  a  real  range 
of  observation,  and  have  not  been  solely  invented  to  describe  isolated  experiences,  the 
numerical  results  of  which  are  not  in  complete  accordance  with  simple  Mendelianism. 
Given  such  neo-Mendelian  formulae,  there  is  a  perfectly  straightforward  mathematical 
method  of  applying  them  to  randomly  mating  populations,  but  that  method  is 
excessively  laborious,  and  the  biometrician  may  well  hesitate  to  undertake  the  task 
of  their  investigation.  A  few  minutes  suffice  to  invent  a  Mendelian  formula,  but 
weeks  of  labour  may  be  involved  in  testing  whether  it  leads  to  legitimate  results 
when  applied  to  sexually  crossing  races.  Let  us  therefore  have  a  few  simple  general 
principles  stated  which  embrace  all  the  facts  deducible  from  the  hybridisation 
experiments  of  the  Mendelians  ;  these  can  form  the  basis  of  a  new  mathematical 
investigation,  but  it  is  idle  to  undertake  such  an  investigation  so  long  as  Mendelian 
Principles  remain  in  a  state  of  flux. 

Any  combination  of  the  theory  of  pure  gametes  here  discussed  with  homogamy,  or 
with  fertility  correlated  with  homogamy,  or  again  with  prepotency  of  individual  or  of 
type,  would  emphasise  the  correlations  which  we  have  found  above  to  be  too  low ; 
but  such  hypotheses  would  involve  a  fundamental  alteration  in  the  formula 

{a  +  a')  (A  +  A')  =  wA  +  aA'  +  a' A'  +  AA'. 

Such  a  formula  would  then  give  the  possibilities  of  the  cross,  but  the  proportions  of 
these  possibilities  actually  occurring  would  be  quite  different. 

Such  loading  of  the  possibilities — not  only  of  the  individual  couplet — but  very 
probably  of  associated  couplets  in  the  constitution — might  conceivably  enable  us  to 
deduce  better  values  for  the  ancestral  and  collateral  correlations.  But  it  would 
abolish  not  only  the  simplicity  of  the  fundamental  Mendelian  formula,  it  would  also 
involve  lengthy  preliminary  studies  on  homogamy,  fertility,  and  prepotency  before 
any  effective  formula  could  be  propounded. 

*  Toss  two  pennies,  and  the  result  of  4'n  tossings  will  closely  approximate  to  the  distribution 
«.  (HH  + 2HT  + TT).  Load  one  or  both  coins,  and  the  possible  variations  will  still  1)6  HH,  HT  or  TT,  but 
their  proportions  will  be  far  from  n  :  2n  :  n. 
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